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ON CHERN’S INVARIANT FOR RIEMANNIAN 4-MANIFOLDS 
HANS SAMELSON 


1. Introduction. S. S. Chern has defined, guided by geometric 
considerations, a differential geometric invariant for closed, ori- 
entable, 4-dimensional Riemannian manifolds [1];! the invariant is 
closely connected with the fact that the orthogonal group in 4 vari- 
ables is essentially, that is, up to a finite covering, the direct product 
of two orthogonal groups in 3 variables. It is the purpose of this note 
to exhibit a space for which this invariant is not zero; at the same 
time we shall verify the well known generalized Gauss-Bonnet 
formula for the curvatura integra. The space is the complex projec- 
tive plane K®, in its elliptic metric; this metric is discussed in detail 
in [2], and we shall freely refer to this paper for the necessary formulae 
of Riemannian geometry in Cartan’s formulation; for the basic 
definitions of differential forms we refer to [3]. We make use of the 
well known fact that K? is the base space of a fiber decomposition 
of S* with S' as fiber (see [4] and [5]). 


2. The space K*. We now consider the complex plane K?. As is 
well known, it is obtained from the 5-sphere S® by a fiber decomposi- 
tion [4]: We represent S® as unit sphere >| z:| 2=1 in a complex- 
euclidean 3-dimensional space C* with coordinates (2), 22, 23). Two 
points y and z of S® are called equivalent, under the relation 9, if there 
exists a real 6 such that y,=2,-e*; the equivalence classes, or fibers, 
are great circles, with @ as arc length. 

One finds that the decomposition space S*/O with the induced 
topology is a 4-dimensional manifold; it is called K?, and with the 
canonical map ¢: S*—K? the system (S*, S', K?, @) is a fiber bundle 
[5] (with the rotations of S! as structure group). In addition there is 
a natural way of making K? into a differentiable (even analytic) 
manifold, such that @ is a differentiable (analytic) map: if 2° 
=(2, 2, 4) with, say, 20, is a point of S*, then the real and 
imaginary part of 2/2, and 23/2; serve as analytic coordinates in the 
neighborhood of ¢(z’). Now S* is a Riemannian manifold, as subspace 
of C*, with fundamental form >| dz; 2. One verifies after some com- 
putation that for a vector = (dz, dze, dz3), tangent to S® at (21, 22, 23), 
the expression dz; 2_| > 2 depends only on the image 
vector of £ under the map @, and that it represents a positive definite 


Received by the editors June 1, 1949. 
1 Numbers in brackets refer to the references cited at the end of the paper. 
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(analytic) quadratic form on the tangent space to K?; the Riemannian 
geometry determined by it is called the elliptic hermitian geometry of 
K?; from now on we consider K* with this metric (see [2]). Any 
vector of S*', which is perpendicular to the fiber through its point of 
support, is mapped by ¢ into a vector of the same length. Using this 
fact, and considering the volume forms (g*dx,---dx,) of the 
Riemannian manifolds S*, K?, and S', one can show that the volumes 
v(S*), v(K?), and v(S') satisfy the relation v(.S*) =v(K?) -v(S'); since 
v(S') =27, and v(S*)=2* [6, p. 303], we have 
(*) o(K?) = 

3. The differential forms. Chern’s invariant and the total curvature 
form are expressed in terms of Cartan’s differential forms [1]. We 
first consider the complex curvature forms (2;; for K*; we take the 


necessary formulae from [2]: there are two complex 1-forms w,; and 
and we, and the Q2;;’s are given by 


03; = + > Q;; = W i j. 
We have to set and 


The volume form dv is then given by 0,6.y:2, and we note, in conse- 
quence of (*), §2, that 


f = f dy = 
kK? K2 2 


Substituting into the Q;,;’s we have 
Dis = (0; + — i) + DO Oe + (Or — tn) 
+ Vidi, 
Q = (0; + — Wi) = 0; + + — t j. 


ll 


Setting 0;;=0;;+7¥;;, and denoting the real curvature forms by 
Ys, x, 8 =0, 1, 2, 3, we obtain for the matrix y" = (Qs) by [2, (44) ] the 
following: 


-¥ 0 6201 + 
Qr = ( ) with © = ( ) 
Vv 6:02 + 0 


eon + W102 — ) 
— + 2101) 


With the quantities 23, we now compute the Chern form I, and 


and 


| 
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the total curvature form K; they are given by T'=)vecs [%,]? and 
K = (see [1, p. 970]). By a straightforward 


multiplication we obtain 
r= 240, = — and K = 2406 = 24d>. 
Incidentally this shows that T really is a form of K? and not only of 


a fiber bundle over K?. 
We obtain now for the characteristic, using (**), 


P 1 6 6 x 
f K= dy = —-— = 3, 
47? K? K? 2 


which checks with well known facts, and for the Chern invariant 


1 12 
7=—: [ w--6, 
2x? K? 


which is different from 0; as predicted by the theory, the value of 7 
is integral. The two invariants J, and I; of [1] have the values 6 
and —18 for 
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FINITELY-VALUED COHOMOLOGY GROUPS 
JOHN W. KEESEE! 


In this note it will be shown that finitely-valued functions are 
adequate to determine the Alexander-Kolmogoroff cohomology 
groups of a compact Hausdorff space. This is effected by exhibiting 
a homomorphism of the finitely-valued groups into the general 
groups which is an isomorphism onto if the space is compact. We 
assume familiarity with Spanier’s [2]? development of the Alexander- 
Kolmogoroff groups although the definitions are repeated here. 

Let X be an arbitrary topological space and G a fixed abelian 
group. Denote by ®9(X) the group of all functions (-functions) 
from Xt! (the Cartesian product of X with itself +1 times) to G. 
Let { N(AX?+) } represent the open sets containing the diagonal of 
X?*!, Then ©§(X) is the subgroup of ©?(X) consisting of those p- 
functions which are zero on some N(AX?+"). There is a homo- 
morphism 6 of 6?(X) into 6°+1(X) defined by the formula 


t=0 
This homomorphism has the property that 55=0. The subgroup 
is called the group of p-cocycles, and 3(X), the 
group of p-coboundaries, is defined to be 567-1(X) +4?(X). The pth 
cohomology group of the space X, denoted by H?(X), is the quotient 
group 
For each f: XY (f not necessarily continuous) there is defined 
a homomorphism f*: Y)-?(X) by 


Std(xo, Xp) (f(o), f(x), »S(%p)), = 


If f is a map (=continuous function), f* carries each of the subgroups 
$5(Y), and ®2(Y) into the corresponding subgroup of 6?(X). 
Let A be a subset of X and 1:A-—>X the identity map. Then 


=i 8A), 2X, A) = 
@p(X, A) = 56” (X, A) + A), 
@2(X, A) = &°(X, A) 82(X). 


Presented to the Society, April 2, 1949; received by the editors June 3, 1949. 
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The pth cohomology group of X mod A is H*(X, A) 
A)/®5(X, A). 

If f is a function on X to Y that carries a subset A of X into a 
subset B of Y, f is said to be on (X, A) to (Y, B). We write 


f: (X, (Y, B). 


If f:(X, A)->(Y, B) is a map, f* carries °(Y, B), 2(Y, B), and 
$3(Y, B) into 69(X, A), &3(X, A), and ®3(X, A) respectively. 

For a fixed pair of functions f and g on (X, A) to (Y, B) thereisa 
function D: B)—-?(X, A) defined by Do(xo, x1, - - - , xp) 
= g(a), fxs), , f(x»). Then for 
p>0 and B), If p=0, 
= 

The purpose of this paper is to consider the groups H?(X, A) de- 
fined as follows. Let F?(X) be the subgroup of ©7(X) consisting of 
those #-functions which have only a finite number of elements of G 
as values. Define F?(X, A) =@?(X, A)(\ F(X), Fi(X, A) A) 
(\ F(X), F3(X, A) A)+ A) (when p=0, F2(X, A) 
=0), F2(X, A) =82(X, A)C\F*(X). Then H7(X, A) is the quotient 
group F2(X, A)/F3(X, A). It will be shown that for an arbitrary 
space X, there is a homomorphism r:H}?(X, A)—H?(X, A) which is 
an isomorphism onto if X is a compact Hausdorff space and A is 
closed. 


Lemma 1. Let M and N be abelian groups with subgroups My and 
No and X a homomorphism of M into N such that }(Mo)CNo. Let 
a:M—M/M, and 8B: N—N/N, be the natural homomorphisms. Then 
there is a homomorphism p:M/M defined by pa(x) =B8X(x) 
for xGM. Furthermore, the kernel of p is }~'(No)/Mo and the image 
of M/ Mo under p is (A(M)+No)/No. 


The method of proof is evident. 

The results of Lemma 1 are used to define the homomorphism r 
as follows: Denote by y:®2(X, A)—H?(X, A) and v: A) 
—H?(X, A) the natural homomorphisms and let o: A) 
—7(X, A) be the identity homomorphism. Define r:H}?(X, A) 
—H?(X, A) by = for FZ(X, A). It follows that 7 is a 
homomorphism, the kernel of r is ®3(X, A)(\F?(X)/FR(X, A), and 
the image of H?(X, A) under is F?(X, A) +3(X, A)/®3(X, A). 

The next two lemmas are concerned with properties of subsets of 
G that satisfy the following two conditions: (1) If a subset H of G 
has property M, then (—H) has M. (2) If Hi and H have M, then 
(Hi+H:) has M. For such a property M, define M?(X, A) to be all 
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@€(X, A) such that ¢ carries some N(AX?t') into a set H with 
property M. 
In what follows, if U is a covering of X, N,(V) will denote the set 


U[U?| VEV]. 


LEMMA 2. Hypothesis: (a) X and Y are topological spaces, f and g 
are functions on (X, A) to (Y, B); 

(c) VY is a covering of Y such that 6=0 on Nyis(V)OB?**', and 56 
carries Nyi2(¥V) into a subset N of G with property M; 

(d) U és a covering of X such that for each UCU, there isa VEV 
for which f(U)\Ug(U)C V. 

Conclusion: For p>0O, A)+M*(X, A); for 
p=0, M*(X, A). 

ProoF. It is sufficient to show (i) if p20, M?(X, A) and (ii) 


if p>0, A). First consider (xo, - , 
Then Dig(xo, , %p)= - - , g(x), 
(\B?*? so Dio =0 on and therefore A). 
Also for (xo, - - , Ddp(xo, Xp) iN. 
But this set has property M since N has and Di@€ M?(X, A). 
Second, for p>0O, take (x0, -- +, Then (g(xo), 


Xp-1) = 0. So Dp EC A). This completes the proof of the lemma. 


Lemma 3. If X is a compact Hausdorff space and A 1s closed, then 
A) C 667-1(X, A) + M(X, A)+ F(X, A). 


Proor. If ¢€®(X, A)(\5-!M?+1(X, A), then there exists a cover- 
ing ¥ of X for which ¢=0 on and carries 
into a subset of G with property M. Let VU be a finite star refinement 
of Y [1, p. 324]. Write U=[Ui, Us, - - - , Un]. To each U; for which 
assign To each U; such that 
assign #,;CU;. Define f:(X, A)—(X, A) by f(x)=%; where 7 
is the least integer for which xCU;. Then f(X) is a finite 
subset of X and F(X). Also A), for consider 
(x0, °°, Then x,CU, (i=0,---, p) and 
there exists Vo€¥V such that Vo) the star of U;. Then f(U;)C the 
star of U; implies f(x;)€ the star of U;. Hence (f(x), f(x), ---, 
f(xp)) EV2*", and - - - , Xp) =G(f(x0), - - , f(xp)) =0. There- 
fore A). Let i:(X, A)—>(X, A) be the identity function; 
then for each UCU, there exists VEY such that 7(U)\Uf(U) CV. 
Now by Lemma 2, A) _M?(X, A) or 


| 
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€ 66>-1(X, A) + M?(X, A) + F°(X, A). 


For the next lemma define L?(X) to be the group of p-functions 
which assume only a finite number of values on some N(AX?*"), and 
let L?(X, A) = L?9(X)(\?(X, A). It follows that L?(X, A) = F?(X, A) 
+2(X, A) and 6L°(X, A) Ch A). 


Lemma 4. If X ts a compact Hausdorff space and A is closed, then 
A) CO3(X, A)+ F(X, A). 


Proor. Since the property of being finite is a property of the type 
of Lemma 2, it follows by Lemma 3 that 


A) (\5L**1(X, A) C A) + L?(X, A). 
But A)+L9(X, A)+2(X, A)+ F(X, A) 
= @3(X, A)+F*(X, A). 
In the proof of the following theorem, repeated use is made of the 


so-called Dedekind property of groups, that is, if A, B, and C are 
subgroups of a group G, and A is contained in C, then 


(A+ B) OAC = (ANC) + (BNC). 
THEOREM. If X is a compact Hausdorff space and A ts closed, then 


T 4s an isomorphism onto. 


Proor. (a) Show r is an isomorphism. The kernel of r is ®3(X, A) 
(\ / F(X, A). 
Now 


5(X, A) = A) + 40(X, A)] F'(X) 

= [5a A) + 60(X, A)] AL(X) NF*(X) 

= “(xX, A) L(X, A) + A)] OF? (X). 
By Lemma 4 A)L*(X, A) A)+6F?-(X, A). 
Therefore, A)+62(X, A)]OF*(X). 
But [5Fe—"(X, A)+42(X, A) =5F?-(X, A)+2(X, A) 
(\F?(X) = FR(X, A). This shows that the kernel of r is 0. 

(b) Show 7 is onto. The image of H?(X, A) under 7 is FZ(X, A) 
+3(X, A)/®3(X, A). In Lemma 3 define “HW has M” to mean 
Then M?t!(X, A) =@$t!(X, A) and 

2(X, A) C 58” A) + A) + A) 
= A) + F(X, A). 


But since ®3(X, A)C®3(X, A), it follows that ®23(X, A) C®3(X, A) 


) 1950] 421 

| 
| 

| 

) 

| 

| 


422 H. J. RYSER [August 


+ F(X, A). The reverse inclusion is clear and the equality shows that 
T is onto. 
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A NOTE ON A COMBINATORIAL PROBLEM 
H. J. RYSER 
The purpose of this note is to prove the following theorem. 


THEOREM. Let x1, X2,-+-,X, denote v distinct elements and let 
Ti, Tz, «++, T, denote v sets formed from these elemenis. Suppose 
further that each T; consists of exactly k distinct elements and that every 
pair of distinct sets T; and T; have exactly \ elements in common 
(0<A<k<v). Then 


k(k — 1 
v-—1 


The combinatorial problem under consideration makes its appear- 
ance in the study of projective planes, Hadamard matrices, and 
block designs. In the combinatorial problem of Todd arising in the 
study of Hadamard matrices it is usually assumed that A 
=k(k—1)/(v—1) [3; 4].! For the symmetrical block designs each 
element of the arrangement is required to occur exactly k times, and 
it is then easy to verify that \=k(k—1)/(v—1) [1]. Further theorems 
concerning the possibility of this combinatorial problem for a given 
v, k, and \ may be found in [3]. 

To prove the theorem let the elements x, - - - , x, be listed in a 
row, and let the sets 7, - - - , T, be listed in a column. Form the 
incidence matrix A of the arrangement in the usual way by inserting 
a one in column #7 and row j if x; belongs to set 7, and a zero in the 
contrary case [2; 3]. Now let s; denote the sum of column 7 of the 


Presented to the Society, September 2, 1949; received by the editors June 9, 1949. 
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matrix A. Let rowr of A have ones in the k columns7;<re< - 


and let t=A(v—1)+k. Then by the hypothesis of the theorem it 
follows that 


The matrix of the above system of v equations in v unknowns is 
simply A. Moreover, it is clear that 


Sstot---+s,— kv =0. 


Now /#0, and consequently the matrix 


1 
C=| A 

1---1 


has a zero determinant. Multiplying C by its transpose implies that 


AF1---AF1 
A+1 k+1 

lata 


where r=k+kv/s and s=v+k*v?/t?. Multiplying row v+1 by —r/s 
and adding to each of the other rows gives s-det D=0, where the 
matrix D has k+1-—r?/s in the main diagonal and \+1—7?/s in all 
other positions. An evaluation of the determinant of the matrix D 
implies that 


=0, 
and consequently 
= 0. 
Since 


ry + 0)? 


= ? 
kv 


it is clear that 


(t + 0)(2 + = + 


ia 

= 
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Thus and t=k?. 


COROLLARY. Every element in the arrangement described in the pre- 
ceding theorem occurs exactly k times and every pair of elements occurs 
exactly times. 


The corollary is an immediate consequence of the preceding 
theorem and Theorem 1 derived in [3]. 

A simple and direct alternative proof of the corollary may be ob- 
tained by noting that for \=k(k—1)/(v—1), the equations s,;+ -- - 
+s,=kv and s;(s1—1)+ - - - +5,(s,—1) =Av(v—1) imply that s;=k 
(see [1]). The writer is indebted to Professor Marshall Hall, Jr., for 
this suggestion. 
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LINEAR REPRESENTATIONS OF TOPOLOGICAL GROUPS 
MORIKUNI GOTO 


Introduction. In his recent papers entitled Faithful representations 
of Lie groups, I and II,! the author has called a locally compact con- 
nected group L faithfully representable (f.r.) if L admits a faithful 
representation,” and has determined the characteristic properties of 
f.r. groups. It is to be pointed out that although an f.r. group is a 
Lie group, a connected Lie group is not always f.r.* Furthermore, he 
introduced there a notion of (J)-groups, and proved that a Lie group 
is f.r. if it is an (l)-group,* where the definition of (l)-groups runs as 
follows: 

DEFINITION 1. Let G be a locally compact connected group. Then G 
1s called an (l)-group if for every element g of G distinct from the identity 
there exists a representation of G under which g does not go over into the 
unit matrix. 

The above definition is obviously equivalent to the one of imposing 
on G the condition that G can be approximated by f.r. groups, namely 
that there exists a system {Na} of closed invariant subgroups of G 
so that every factor group G/N. is f.r. and the intersection of all 
N.’s coincides with the identity group {e}, which is composed of the 
identity e only. 

In the present paper we shall study the structure of (/)-groups as 
a sequel of [M.G.]. Now K. Iwasawa called a locally compact 
group G an (L)-group if G can be approximated by Lie groups; he 
studied the structure of (L)-groups, and in particular solved the so- 
called fifth problem of Hilbert for such groups.' Our (/)-group is of 
course a connected (L)-group and our study is based naturally on the 
theory of (L)-groups also. 

Next we shall state the outline of the present paper. In §1 we study 
semi-simple (L)-groups, where the semi-simpleness of an (L)-group is 
defined in an analogous way as in the theory of Lie groups. §2 is 


Received by the editors May 3, 1949. 
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devoted to the proof of Theorem 2, which has an essential bearing on 
the latter part of this paper. Now in §3 we deal with the class of 
completely -reducible (l)-groups, which contains both compact con- 
nected groups and semi-simple (/)-groups. After such preparations 
we shall investigate in §4 the structure of general (/)-groups. Theorem 
5, as well as Theorem 6, is our fundamental result, affording a char- 
acterization of (/)-groups. Lastly in §5 some supplementary facts 
about solvable (/)-groups are stated. 

In closing the introduction, the author wishes to express his sincere 
gratitude to Mr. Y. Matsushima for his valuable suggestions and co- 
operation. 


1. Semi-simple (L)-groups. Let G be a locally compact group. G 
contains then the radical, that is, the uniquely determined maximal 
connected solvable invariant subgroup, which is closed in G in virtue 
of [K.I.]. Now we shall extend the notion of semi-simpleness of Lie 
groups to (L)-groups, as follows: 

DEFINITION 2. An (L)-group is called semi-simple if its radical coin- 
cides with the identity group. 

Since the radical is characteristic, any closed invariant subgroup of 
a semi-simple (Z)-group is also semi-simple. 

Now in order to study the structure of semi-simple (L)-groups we 
first give the following two lemmas. 


Lema 1. Let G be a locally compact connected group, and let C and N 
be closed invariant subgroups of G such that G=CN. Denote by C° and 
N° the connected components of C and N containing the identity respec- 
tively. Suppose C, or C°, be compact, then we have G=C°N®. 


Proor. As C° is compact, C®°N is a closed invariant subgroup. Since 
the intersection of C°N and C contains C°, the isomorphism® G/C°N 
=C/C°N(\C implies that G/C°N is 0-dimensional. On the other 
hand, G/C°N must be connected because of the connectedness of G. 
Hence we get G= C°N. Now a similar argument as above would estab- 
lish the lemma. Q.E.D. 


LemMA 2. Let G be a connected semi-simple (L)-group, and K a 
compact 0-dimensional invariant subgroup of G. Suppose that the factor 
group G/K is a Lie group. Then G ttself is a Lie group. 


ProoF. It is clearly sufficient to prove that K cannot be infinite 
and separable. Suppose that K be an infinite separable group. Then 


§ By an isomorphism (or a homomorphism) we mean a continuous open isomorphism 
(or homomorphism). 
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there exists a sequence { K,}: 


Xn Katy, 


of open subgroups of K such that NK,={e}. Since K is a central 
invariant subgroup and any open subgroup of a topological group 
is closed, K,’s are all closed invariant subgroups. 

We denote by G the universal covering group of G/K. Then we 
can readily select a sequence {Z,} of central invariant subgroups of 


G: 
Zn 


such that G/Z,~%G/K,, n=1, 2, - - -. Denote by Z the intersection 
of all Z,. Then it is easy to construct an algebraic isomorphism be- 
tween G/Z and G. Since K is central, K is a countable group. On the 
other hand, it is easy to prove that a countable compact group is 
discrete. Hence K must be finite, contrary to the hypothesis. This 
completes our proof. 

Now let G be a connected semi-simple (L)-group and let C* be 
the (uniquely determined) maximal compact invariant subgroup’ 
of G. Then by a theorem of Iwasawa in [K.I.], G contains a closed 
invariant subgroup L* of G such that G=C*L* and [C*, L*]= {e} ; 
where we denote by [C*, L*] the subgroup generated by the elements 
of the form c—J-'cl with cE C* and/ CE L*. Let Cand Lbe the connected 
components of C* and L* respectively containing the identity. Then, 
using Lemma 1, we obtain that G=CL. 

Let now K be the maximal compact invariant subgroup of L, and 
K° the connected component of K containing e. C being the maximal 
connected compact invariant subgroup, K® must be contained in 
the intersection of C and L. Hence K° is central. This shows that K® 
coincides with the identity group, whence K is 0-dimensional. There- 
fore, using Lemma 2, we get the fact that L is a Lie group. Now de- 
note by D the intersection of C and L. Since D is compact and is con- 
tained in the center of L, D must be a finite group. 

Next let {C,} be a sequence of compact connected simply con- 
nected semi-simple simple Lie groups, and C the direct product of all 
Cy’s: C=[]C,. C is also a compact group with respect to the so-called 
weak topology. Now let D be a closed 0-dimensional invariant sub- 
group of C. Then the factor group C/D is obviously semi-simple, and 
conversely any connected compact semi-simple group is readily 


7 The existence, as well as the uniqueness, of such a subgroup for any connected 
(L)-group has been assured in [K.I.]. 
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proved to be obtained in such a way.*® 

In a similar way, we can establish a method of constructing all con- 
nected semi-simple (L)-groups: Let {5,} be a sequence of connected 
simply connected semi-simple simple Lie groups such that almost 
all of 5,’s are compact, and Jet § be the direct product of all 5,’s: 
S= []5,. Then S$ is clearly a connected semi-simple (L)-group with 
respect to the weak topology. Now let D be a closed 0-dimensional 
invariant subgroup of 5. 5/D is clearly semi-simple, and any con- 
nected semi-simple (L)-group is isomorphic with some |]5S,/D for 
suitable {5,} and D. 

Now a factor group of the above []5, is clearly semi-simple. 
Hence any factor group of a connected semi-simple (L)-group is also 
semi-simple. Thus we get the following theorem. 


THEOREM 1. (1) Let G be a connected semi-simple (L)-group, and C 
the maximal compact connected invariant subgroup of G. Then there 
exists a closed connected invariant Lie subgroup® L of G so that G=CL, 
and |C, L|= fe} , and that the intersection of C and L is a finite group. 

(2) Any factor group of G is semi-simple. 

Using the latter part of Theorem 1, we can easily prove the follow- 
ing corollary: 


CoroLuary 1. Let G be a connected (L)-group and R the radical of 
G. If N is a closed invariant subgroup of G such that RN is closed, then 
RN/N is the radical of G/N. 


COROLLARY 2. Let G be a connected group and S a closed connected 
invariant subgroup. We denote by S* the centralizer'® of S. If S is a 
semi-simple (L)-group, then we have G=SS*. 


Proor. Corollary 2 has been proved in [K.I.] for any compact 
invariant subgroup S. It is also clear for a connected semi-simple 
Lie group S. Now it is not hard to establish the corollary from these 
special cases. Q.E.D. 


2. Proof of Theorem 2. In this section we shall give a proof of the 
following theorem. 


THEOREM 2. Let G be an (l)-group and K a compact invariant sub- 
group. Then the factor group G/K is also an (1)-group. 


§ Cf. H. Freudenthal, Topologische Gruppen mit geniigend vielen fastperiodischen 
Funktionen, Ann. of Math. vol. 37 (1936). 

® By a (closed) Lie subgroup we mean a subgroup which is a Lie group. 

0 Namely, the (invariant) subgroup composed of all elements which commute 
with every element of S. 
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In order to prove Theorem 2, we first recollect and modify some 
results stated in [M.G. ]. 

Let G be an f.r. group, and N a maximal closed simply connected 
solvable invariant subgroup of G. Then there exists a closed subgroup 
T, whose radical is compact, such that 


G=TN, TON= {e}. 


Conversely a connected Lie group is f.r. if it admits such a decomposi- 
tion.'! 

Now the radical of an f.r. group is compact if and only if it is com- 
pletely reducible,“ where the complete reducibility of an ()- 
group is defined as follows: 


DEFINITION 3. An (1)-group G is called completely reducible if every 
representation of G is completely reducible. 


Let T be a completely reducible f.r. group and A the radical of T. 
Then A is compact and there exists in T a closed connected semi- 
simple subgroup S such that T=SA. It is clear that [S, A]={e} 
and the intersection of S and A is a finite group. Now any factor 
group of a semi-simple f.r. group is also f.r.,"* and a connected Lie 
group is f.r. if its radical and a maximal semi-simple subgroup” are 
f.r.45 These obviously imply that a factor group of a completely re- 
ducible f.r. group is also a completely reducible f.r. group. 

Next, it is to be remarked that a simply connected solvable Lie 
group is homeomorphic with the Euclidean space, and that a locally 
compact connected solvable group is a simply connected Lie group if 
and only if it contains no compact subgroup other than the identity 
group. 

Now the following lemma is a special case of Theorem 2. 


Lemma 3. Let G be an f.r. group and K a compact invariant subgroup. 
Then the factor group G/K ts also f.r. 


Proor. Let G=TWN be a decomposition as above, and let M be a 
maximal compact subgroup"? of T. Since N is homeomorphic with 


1 [M.G.II, Corollary 2 to Theorem 7]. 

12 (M.G.II, Lemma 14]. 

13 [M.G.II, Lemma 9]. 

4 By a maximal semi-simple subgroup of a Lie group we mean a connected group 
generated by a closed maximal semi-simple local subgroup. 

6 [M.G.II, Theorem 7]. A. Malcev, On linear Lie groups, C.R. (Doklady) Acad. 
Sci. URSS. (1943). 

16 See [K.I.]. 

17 See [K.I.]. 
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the Euclidean space, M is maximal compact also in G. Hence K is 
contained in M, and thus we have K CT. Now since T is completely 
reducible, the factor group 7/K is a completely reducible f.r. group. 
It is now clear that G/K is f.r., because 


G/K = T/K-(NK)/K 


is also a decomposition as above. Q.E.D. 

PROOF OF THEOREM 2. Let g be an element of G which is not con- 
tained in K. For any element x of K we can readily select a representa- 
tion f, of G and a neighborhood U, of x such that f,(U.g) does not 


contain the unit matrix. Now there exists a finite system 1, -,%n} 
of elements of K such that the system { Uz, °*+, Uz,§ of neighbor- 
hoods covers K: UU.;)K. 

Now let f be the so-called sum representation of f.,, - - - , fz,, and 
let Ni, No, - - - , Na, N be the kernel of f:,, f2,, - - , fz,, f. From the 
relations U.z.g/\N;=0 (i=1, 2,---,m), and NN;=N, we have 


U.2(\N=0, where 0 denotes the empty set. Accordingly we obtain 
Kg(\N=0, whence 


Kgl\ KN = 0. 


Now since G/N is f.r. and KN/N is a compact invariant subgroup 
of G/N, G/KN is also f.r. by Lemma 3. Now the relation Kg(/\KN=0 
implies that Kg/K is not contained in the kernel of the homo- 
morphism G/K~G/KN:G/K/KN/K=G/KN, and this completes 
our proof. 


CoROLLARY. Let G be an (i)-group, and U a neighborhood of the 
identity in G. Then there exists a compact invariant subgroup K, which 
ts contained in U, such that the factor group G/K is fr. 


Proor. Since G is a connected (L)-group, there exists a compact 
invariant subgroup K such that G/K is a Lie group and KCU, in 


virtue of [K.I.]. Thus the corollary is an immediate consequence of 
Theorem 2. Q.E.D. 


3. Completely reducible (/)-groups. In [M.G. II]'* we have de- 
termined the class of all semi-simple f.r. groups. Here we consider 
semi-simple (/)-groups. 


THEOREM 3. (1) Let G be a semi-simple (L)-group, and let 
G=CL 


18 Lemma 9, Theorem 3, and Theorem 4. 
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be the decomposition of G as in Theorem 1. Then G is an (1)-group if and 
only if L is f.r. 

(2) The center of a semi-simple (1l)-group is compact. 

(3) Any factor group of a semi-simple (l)-group ts also an (l)-group. 


Proor. Assume that L be f.r. Then the direct product CXL is 
clearly an (/)-group. Now we can readily find a finite central invari- 
ant subgroup D of CXL such that CXL/D=G. Hence G is an (I)- 
group by Theorem 2. The converse is trivial. 

Next, we know that thecenter of a semi-simple f.r. group is finite.'® 
From this the second assertion follows immediately. 

Further, let G be a semi-simple (/)-group and N a closed invariant 
subgroup of G. We denote by N® the connected component of N 
containing the identity. N° is clearly a semi-simple (/)-group. Let C® 
be the maximal connected compact invariant subgroup of N° and let 


N® = C°L® 


be the decomposition of N° as in Theorem 1. C° is obviously an in- 
variant subgroup of G, and the compactness of C® implies that G/C® 
is an (/)-group by Theorem 2. Let now G/C® = C*L* be the decomposi- 
tion of G/C® as in Theorem 1. Then the invariant subgroup N°/C® 
is clearly contained in L*. The fact that a factor group of a semi- 
simple f.r. group is f.r. readily implies that G/ N° is also an (/)-group. 
Hence the center of G/N® is compact by our above assertion. This 
shows the compactness of N/N®, and we obtain the last assertion, 
using Theorem 2 again. Q.E.D. 

It is well known that a semi-simple f.r. group is completely re- 
ducible, and so is a semi-simple (/)-group. Now, we study general 
completely reducible (/)-groups. A criterion for the complete re- 
ducibility is furnished by the following lemma. 


LemMa 4. An (l)-group is completely reducible if and only if its 
radical is compact. 


Proor. Let G be a completely reducible (/)-group. According to 
the corollary of Theorem 2, there exists a compact invariant sub- 
group K such that G/K is f.r. Denote by R the radical of G. Then 
RK/K is the radical of G/K, in virtue of Corollary 1 to Theorem 1. 
Now since G/K is obviously completely reducible, its radical RK/K 
is compact. Hence R must be compact. 

Conversely, let G be an (/)-group with the compact radical R, and 
let N be an arbitrary closed invariant subgroup of G such that the 


19 [M.G.I, Lemma 5]. 
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factor group G/N is f.r. It is sufficient to show that G/N is com- 
pletely reducible, or what is the same, that the radical of G/N is 
compact. Now the compactness of R implies that RN/N is the com- 
pact radical, and this completes our proof. 

Let G be a topological group. We denote by d(G) the commutator 
group of G, that is, [G, G], and by D(G) the closure of d(G). Following 
Iwasawa we call D(G) the topological commutator group of G. The 
topological commutator group of a connected group is obviously 
connected. 

Next, let G be a completely reducible (/)-group and R the radical 
of G. Since R is compact, D(G)R/D(G) is the radical of the commuta- 
tive G/D(G). Hence D(G)R coincides with G: G= D(G)R. Now let K 
be a compact invariant subgroup of G. Since it is clear that d(G)K 
= D(G)K, we have 


D(G)K/K = D(G/R). 


Let Z be the intersection of D(G) and R. In any representation of G 
with a compact kernel, Z goes over into the intersection of the com- 
mutator group and the radical, which is obviously a finite group. 
Hence Z must be 0-dimensional. That D(G) is semi-simple follows 
immediately. 

Now it is easy to prove the following theorem. 


THEOREM 4. (1) Let G be a completely reducible (1)-group, and C the 
maximal connected compact invariant subgroup. G contains a closed 
connected semi-simple f.r. invariant subgroup L such that G=CL, 
[C, L]= fe}, and the intersection of C and L is a finite group. And 
conversely a topological group which admits such a decomposition is a 
completely reducible (1)-group. 

(2) Any solvable invariant subgroup of a completely reducible (l)- 
group is contained in the center, which is compact. 

(3) Any factor group of a completely reducible (l)-group is also a 
completely reducible (l)-group. 


REMARK. It is well known that a locally compact connected group 
is maximally almost periodic if and only if it is decomposable into a 
direct product of a compact group and a vector group.”° On the other 
hand, it is clear that a vector group is not completely reducible. 
Hence, or from Theorem 4, we get the following proposition, by 
which the position of compact groups in the theory of topological 
groups is clarified: 


20 H. Freudenthal, loc. cit. 
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A locally compact connected group is a completely reducible, maxi- 
mally almost periodic group if and only tf it is compact. 


4. Fundamental theorems. 


THEOREM 5. Let G be a locally compact connected group. If G is an 
(1)-group, then there exists a closed completely reducible (l)-group T in 


G and a closed simply connected solvable Lie invariant subgroup N such 
that 


G=TN, TON= {e}, 
and conversely. 


Proor. Let G be an (/)-group. In virtue of the corollary to Theorem 
2, we can select a sequence {Ka} of compact invariant subgroups of 
G such that G. =G/K, is f.r. and the intersection of all K,’s coincides 
with the identity group. Now, because of the compactness of Ka, 
there corresponds D(G,) to D(G) in the homomorphism G~Gzg. 

We denote by the radical of D(G). Since is compact, MK ./Ka 
is the radical of D(G,). On the other hand we know that the radical 
of the commutator group of an f.r. group is simply connected. 
Hence M cannot contain a compact subgroup other than the identity 
group. Therefore M is a simply connected Lie group. 

Now, since D(G) is an (/)-group by itself, D(G) contains a compact 
invariant subgroup K such that the factor group D(G)* = D(G)/K is 
f.r. Now since MN\K= fe}, and MK is closed, M*=MK/K, which 
is isomorphic with M, is the radical of D(G)*. Let S* be a maximal 
semi-simple subgroup of D(G)*. The center of S* is finite and S* is 
closed. Hence the intersection of S* and M* is the identity group. 
Thus we obtain the decomposition 


D(G)* = S*M*, M*= {é}, 
where e* denotes the identity of D(G)*. 
Now let S be the complete inverse image of S* in the homo- 


morphism D(G)~D(G)*. Then S is obviously semi-simple and we 
get 


D(G) = SM. 


Since any solvable invariant subgroup of S is easily seen to be com- 
pact, we have 


S\M = {e}. 


2 See [M.G.I, Lemma 7]. 
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Next, since G/D(G) is connected and commutative, using the 
structure theorem of Pontrjagin, we can assure the existence of a 
closed invariant subgroup H containing D(G) such that G/H is 
compact and H/D(G) is a vector group. 

We denote by WN the radical of H. Now we select a compact in- 
variant subgroup K, of H so that H/K;, is f.r. Ki is obviously con- 
tained in D(G). Hence we can consider K, as K mentioned above 
and retain the above notations. Let N* be the image of N in the 
homomorphism H~H*=H/K. N* is clearly the radical of H*. The 
fact that H*/D(G)* is commutative immediately implies that any 
maximal semi-simple subgroup of H™* is contained in D(G)*. Hence 
S*, as a maximal semi-simple subgroup of D(G)*, is also a maximal 
semi-simple subgroup in H*. Thus we have H* = S*N*. Hence we get 
that 

H = SN. 


Now M* is obviously locally coincident with D(G)*/\N*. On the 
other hand from the relation 


H*/D(G)* & N*/D(G)* A N*, 


N*/D(G)*C\N is a vector group. This is impossible unless D(G)*/\N* 
is connected. Hence we have D(G)*(\N*=M*. Thus the simple 
connectedness of M* and that of N*/M* imply that of N*. 

Next we shall prove that N is a simply connected Lie group iso- 
morphic with N*. First, since the center of S is compact, S(\N must 
be a compact 0-dimensional group. Hence K/\N is also compact 
0-dimensional. Now suppose that K(\N# {e}. Then there exists a 
proper open (closed) subgroup Q of K/\N. Then by the relation 
N/K.\N=N*, N/Q is a proper covering group of N*. This is clearly 
impossible, because N* is simply connected. Hence K(\N= {e}. 
Thus our assertion holds immediately, and moreover we get 


SON = {e}. 


Next we put G/N=G’, H/N=H’. Then G’ is an (L)-group, and 
H’ is a closed connected semi-simple invariant subgroup of G’ be- 
cause H’=S. Hence by Corollary 2 to Theorem 1 there exists a closed 
invariant subgroup Aj/ of G’ so that G’=H’Ai and [H’, Ai | 
= {e’}, where e’ is the identity of G’. The center of H’ coincides with 
H'(\Aj, and is compact because H’ is a semi-simple (/)-group. On 
the other hand, the relation G/H=G’'/H'=SAi/H'(\Ai implies 
that Aj /H’(\Aj is compact. Hence Aj is a compact group. Let now 
A’ be the connected component of Aj containing e’. Then Lemma 1 
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implies that G’=H’A’. 
Next we denote by R the complete inverse image of A’ in the 
homomorphism G~G’. R is clearly the radical of G, and we get 


G = HR= SR. 


Let now G~(G) be a homomorphism of G onto an f.r. group (G) such 
that the kernel is compact, and let (R), (VV) be the image of R, N 
respectively in the homomorphism. It is clear that (R) is the radical 
of (G) and (JV) a simply connected Lie group such that (R)/(N) is 
compact. Hence (JV) is a maximal closed simply connected solvable 
invariant subgroup of (G). Now since (G) is f.r., (G) contains a 
closed completely reducible f.r. group (TJ) such that 


G =(T\(N), (TVA) = {@}. 


Let T be the complete inverse image of (T) in G. The fact that N 


contains no compact subgroup other than {e} readily implies the 
following relations 


G=TN, TON= {e}. 


Thus we obtain in particular the connectedness of T. Now it is easy 
to prove that the radical of T is compact. Hence T is completely 
reducible by Lemma 4. Thus our first assertion is proved. 

Conversely, let G be a locally compact connected group which 
admits a decomposition G=TN, TI\N= fe}, as above. Then, since 
G/N=ST and T is an (/)-group, there exists a system {Na} of closed 
invariant subgroups containing N such that NN.z=N and G/N,’s 
are f.r. 

Now let us denote by 7; the set of all elements of T which are com- 
mutative with every element of N. Then 7; is clearly a closed in- 
variant subgroup of G such that T/T; is f.r. Now in virtue of Theorem 
4, T/T, is completely reducible. Hence the relations 


G/T, = T/T,-NT,/T,, NT,/T, = T,/T, 
NT,/T:= N, 


imply that G/T; is f.r. Thus because of the relation NN.(\Ti= {e} we 
get the result. Q.E.D. 


CoROLLARY. Let G be an (l)-group and K a maximal compact sub- 
group of G. Then there exists a closed connected simply connected solvable 
Lie subgroup H of G such that 


G=HK=KH, K0\H= {e}. 
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ProorF. In virtue of Theorem 5, it is clearly sufficient to consider the 
case when G is completely reducible. On the other hand, the corollary 
holds for f.r. groups by [M.G.II].22 Thus by Theorem 4 we can 
readily complete our proof. Q.E.D. 


THEOREM 6. Let G be a locally compact connected group and R the 
radical of G. If Ris an (l)-group and G contains a closed semi-simple 
(1)-group S such that 

G = SR, 


then G is an (l)-group, and conversely. 


ProoF. In the first place we suppose that R is an (/)-group and S$ 
is a closed semi-simple (/)-group in G such that G=SR. We can 
prove that G is an (L)-group because G/R is obviously an (L)-group.*8 
Let K be a compact invariant subgroup of G such that G/K is a Lie 
group. Then SK/K is clearly a maximal semi-simple subgroup, and 
RK/K the radical, of G/K. Since they are f.r. by Theorem 2, G/K 
itself is also f.r. in virtue of [M.G.II]. This means that G is an 
(/)-group. 

Conversely, let G be an (/)-group and G=T7WN a decomposition as 
in Theorem 5. Let D(T) be the topological commutator group of T, 
and A the radical of T. Then D(T) is semi-simple and we get T 
=D(T)A by Theorem 4. Now it is clear that AN coincides with the 
radical R. Hence we have G=D(T)R, and this completes our proof. 


5. Solvable (/)-groups. Let G be a locally compact connected solv- 
able group. Then G is an (L)-group in virtue of [K.I.]. Hence there 
exists a compact invariant subgroup K such that G*=G/K is a Lie 
group. Let now D(G) be the topological commutator group of G. 
Then because of the compactness of K we have 


D(G*) = D(G)*(= D(G)K/K). 
Now it is well known that D(G)* is nilpotent. On the other hand K 
is a central invariant subgroup by [K.I.]. Hence D(G)K is nilpotent, 


and so is D(G). Thus the topological commutator group of a locally com- 
pact connected solvable group is nilpotent.™ 


# [M.G.II, Corollary 1 to Theorem 7]. 

%3 Let G be a locally compact connected group, and N aclosed connected invari- 
ant subgroup of G. Suppose N and G/N are both (L)-groups. Then G is an (L)-group 
by itself. See [K.I.]. 

* We can prove mureover the following theorem: The radical of the topological 
commutator group of a connected (L)-group is nilpotent. This is an extension of a theorem 
of E. Cartan. Cf. E. Cartan, Thése, Paris, 1894. 
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Now we shall prove the following lemma. 


Lemma 5. Let G be a connected group and N a closed connected in- 
variant subgroup. Let K be a compact subgroup of N. If N is locally 
compact and nilpotent, then K is central in G. 


Proor. Let K, be the maximal compact invariant subgroup of N. 
Then K, is commutative and N/K, is a Lie group. On the other hand, 
we know that any compact subgroup of a connected nilpotent Lie 
group is a central invariant subgroup.** Hence we can easily prove 
that K is contained in K,. 

Now it is clear that K, is an invariant subgroup of G. Then because 
of the connectedness of G, K; must be central.** Hence K;, is a central 
invariant subgroup of G. Q.E.D. 

It is now easy to establish the following theorem. 


THEOREM 7. Let G be a locally compact connected solvable group and A 
a maximal compact subgroup of G. The following conditions are all 
necessary and sufficient for G to be an (l)-group. (1) G=AN, AI\N 
={e}, where N is a closed simply connected Lie invariant subgroup. 
(2) D(G) is a simply connected Lie group. (3) The center of D(G) is a 
connected simply connected Lie group. (4) A(\D(G) = {e}. 


From Lemma 5 and Theorem 7 we have in particular the following 
corollary. 


COROLLARY. A locally compact connected nilpotent group is an (l)- 
group tf and only if it is a direct product of a compact group and a 
connected simply connected Lie group. 


NaGoyA UNIVERSITY 


% [M.G.II, Lemma 13]. 
% See [K.I.]. 
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ON A CONJECTURE ON SIMPLE GROUPS 


I. N. HERSTEIN 


The purpose of this paper is to rephrase a conjecture about simple 


groups into the language of linear algebra. 

Let G be a group of finite order 0(G). Then by I, we shall mean the 
group ring of G over a field of characteristic p (for instance the integers 
modulo p). We shall denote the radical of T, by N,. If p=0 or 
plo(G), then it is known that N,=(0); and if o(G), N,#(0). 

We now consider the following two assertions: 

(A) If Gis a simple group of odd order, o(G) is a prime. 

(B) If G ts a group of odd order o(G), then for some prime p, p\ 0(G), 
we can find a g¥1, such that 

The theorem which we propose to prove is: 


THEOREoM. (A) ts equivalent to (B). 


1. (B) implies (A). 
Derinition. U,= {gE€G|g—1EN,}. 


Lemna 1. U, ts a normal p-subgroup of G. 


ProoF. (a) U, is a subgroup of G, for if gi, geG Uy, then since N, is 
a left-ideal of T,, gi(ge—1)+(g1—1) 

(b) U, is normal, for if g—1€N,, since N, is a two-sided ideal of 
A(g—1)h-' =hgh-!—-1€EN, for all hEG. 

(c) If g—1€N,, then for some integer S, (g—1)" =0=g”"—1. So 
if g&U,, g is of order p* for some s. So U, is a p-group. 


COROLLARY. (B) implies (A). 


Proor. By (B), U,;#1 for some p| 0(G). Hence since G is simple, 
and since U, is a normal subgroup of G, U,=G. Thus G is of order p’, 
and G being simple, s=1. Hence (B) implies (A). 

2. (A) implies (B). 


LEMMA 2. (A) implies that every group of odd order is solvable. 


Proor. Let G=Gi)G2D - - - DG,=1 be a composition series for 
G. Since the G,;/G;,; are simple and of odd order, by (A) they must 
be of prime order; hence the lemma is proved. 


Since a solvable group contains a normal p-subgroup [1, p. 25, 
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Theorem 20],! we immediately obtain, using Lemma 2, the following 
lemma. 


Lemma 3. (A) implies that if G is of odd order, then it contains a 
normal p-subgroup. 


For groups of certain orders the N, can be completely described. 
This is true for p-groups. If G is of order p*, then for every gGG, 
g—1CN, [2, p. 176, Theorem 1.2 or 3, p. 239]. For our case it is 
sufficient to use the weaker result: 


Lemma 4. If G is of order p*, then for some g¥1 in G, g—1CN,. 


ProorF. Since G is of order *, it has a nontrivial center C. Let g¥1 
be in C. Then since g—1 is in the center of I',, and since (g—1)” 
=g"’—1=1—1=0, g—-1EN,. 

Suppose that S is the normal p-subgroup of Lemma 3. An element 
of the form g—1€T, is in N, if and only if for every irreducible rep- 
resentation @ of G, (g) =1. Clifford’s theorem [4, p. 534, Theorem 1] 
reduces the irreducible representations of G to irreducible representa- 
tions (or into ones fully reducible into irreducible components) of S. 
For the g of Lemma 4 in S, for every irreducible representation ¢ of 
S, 6(g) =1. So by Clifford’s theorem, for every irreducible representa- 
tion ¢ of G, ¢(g) =1. Thus g—1€N,. And so we have shown the fol- 
lowing lemma. 


Lemna 5. (A) implies (B). 
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ON A SEMI-GROUP OF SUBSETS OF A LINEAR SPACE 
R. E. FULLERTON 


A family 7 of subsets of a space is defined to be a semi-group of 
subsets provided that the intersection of any two subsets of F is 
again a subset of 7. In giving a characterization of a Banach space of 
continuous functions in terms of the geometry of the space, Clarkson! 
considered a cone € which had the property that the family of all of 
its translates formed a semi-group. This note is concerned with an 
investigation of the structure of a subset S of a linear space ¥ such 
that the family of all translates of S form a semi-group, that is, for 
any points x, y of ¥ there exists a z©X such that (x+S)(\(y+S) 
=z+S. It will be shown that under certain rather weak restrictions, 
S must be a convex cone. 

Let ¥ be an abstract linear space. The only topology which will be 
assumed for ¥ is that which arises from the topology of the straight 
lines of that is, if u,=a,x+,y, x¥~y, +8,=1, a, a, B,—8, 
then u,—ax+ By. This will be called the linear topology of ¥. We shall 
assume that all straight lines of ¥ are complete in this topology. A 
point x is an extreme point of a set S if x<S and there exists no line 
segment with end points in S which contains x in its interior. The 
theorem to be proved is the following: 


THEOREM. Let S be a subset of ¥ which is closed in the linear topology 
and which has at least one extreme point. Let the family of all trans- 
lates of S form a semi-group. Then S is a convex cone, that is, S is convex 
and there exists a pointvCS such that 


S=Ely=0+dz, €5S,d2 0). 


The theorem is proved by four lemmas. 
1. Jf x+S=S, then x=0. 


ProoF. It is evident that any translate of S can be used initially 
instead of S. Thus we may assume that S contains @ without loss of 
generality. If x+S=S, «ES. Also S=S—x and —x€ES. If yES, 
xt+tyES and —x+yES. However, if x*—«x this shows that y is 
the midpoint of a line segment joining two points of S and con- 
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tradicts the hypothesis that S has an extreme point. 
LEMMA 2. S contains only one extreme point. 


Proor. Suppose that S contained at least two extreme points. It 
may be assumed that one is the point 0. Let the second be 7+8. 
Consider the translation S+v. Since @ and v are both extreme points, 
2v€S and S+v does not contain S. Hence (S+v)(\S=y+S where 
y*6 and y is an extreme point of y+ S. Also 


AS]} OS 


y+(yt+S) = 2y+4+S. 


This shows that v, y+v, and 2y+v are in S and y+S contrary to the 
conclusion that y is an extreme point of y+. 


Lemma 3. If v is the extreme point of S, S is a cone with vertex v. 


Proor. It may be assumed that v=@. It must be shown that if 
x€S; then Ax€S for all AZO. Since Ax is a complete ray, if there 
exist values of \ for which Ax€S, there must exist a smallest value 
Xo such that because of the linear closure of S. (Age +S)OS 
=y+S where y is the extreme point of y+.S. However, since Aox 
€E(Aox+S)OS, Aox is an extreme point of y+ S and since there is 
only one extreme point of the set, y=Agx by Lemma 1. Then Agx 
+SCS and 2A9xCS. By induction it can be seen that if Aox CS, then 
nox CS for any positive integer m. Also there exists a z©S such that 
+ S)VS=2+S. This implies the existence of a point 
such that z = u+2-"Xox. Also, by the first part of the proof, ~+AgxE S. 
Since z is the extreme point of s+.S, u+(n/2)AoxE2z+SCS for every 
integer » by the same argument as was used in the first part of the 
proof. Then S=S—z contains all points of the form u+(m/2)Aox 
—(u+2—Xox). In particular, S contains the point 2—'Aox, contrary 
to the assumption that A» was the smallest nonzero value of \ for 
which AxC SS. This shows that if AxCS for all X20 and Sisa 
cone with vertex @. 

It is to be noted that if @ is the vertex, the above proof shows that 
if then AxCS for all non-negative \ and Ax+.SCS. Hence, if 
yES, y+rAxES for all X20. This shows that if S contains a ray 


through any point, it contains a parallel ray through every point of 
the set. 


Lemna 4. S is convex. 
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Proor. It must be shown that if x and y are in S, then ax+SyES 
for a, B20; a+8=1. If S has vertex @ and if y=)x, this is true by 
Lemma 3. If y#¥Ax, then the rays Ax, vy, A20, v=0, are distinct 
and are in S. If for any point ax+ fy, a, 8>0, a+8=1, there exists a 
such that po(ax+Py) CS, then p(ax+8y)ES for all by 
Lemma 3. S contains the ray y+Ax where \20. If the existence of 
positive numbers yw and A can be demonstrated such that u(ax+ fy) 
=y+ Ax, the lemma is proved. If this is so, (ua—d)x+(u8—1)y=6 
and if ~=1/8, \=a/8, the rays u(ax+6y) and y+Ax have a common 
point. Since all points of the form y+ Ax are in S, (1/8) (ax+By)ES 
and hence fy) for all »=0, in particular for Hence S 
is convex. 

By making use of this theorem it is possible to weaken the hy- 


potheses of Clarkson’s theorem slightly and to state it in the following 
form. 


THEOREM. A necessary and sufficient condition that a Banach space 
be equivalent to the space of all continuous functions over a bicompact 
Hausdorff space H is that there exist a closed set S with ai extreme point, 
containing 0 in its interior, such that all translates of S form a semi-group 
and such that the unit sphere in the space is the set S(\(—S). 
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NOTE ON THE ARITHMETIC OF 
BILINEAR TRANSFORMATIONS 


DONALD M. ADELMAN 


1. Introduction. Since the time of Lucas many papers have ap- 
peared investigating the properties of sequences of rational integers 
satisfying linear recurrence relations. Very little,! however, has been 
done in the nonlinear cases, although the generalization to variable 
coefficients seems to have awakened more interest than the general- 
ization to relations of higher degree but with fixed coefficients. The 
present paper deals with one of the simplest problems of the latter 
type. 

Let a sequence be defined by the value x; and by the relation: 


(1) Xn+1 + b)/(cxn + d); 


ad—bc#0; a, b, c, d, rational integers. Under what conditions are all 
the x; integral? That such integral sequences exist may be shown by 
two examples. First, if x1 =2 generates the sequence 


2, 2, 2,---+; second, if gen- 
erates the sequence 4, —26, 34, 19, 14, 10, 4,---.If we write 
Yn=CX, +d, e=a+d, m=bc—ad, (1) becomes: 

(2) Yui = e+ m/ Yn 


where {y;} is integral if {x;} is. 


THEOREM 1. Every sequence of integers satisfying (1) or (2) is 
periodic. 


Proor. In (2) every y, must be one of the finitely many divisors 
of m. The periodicity of {y,;} entails that of {x;}. Moreover, the 
period begins with the first term of the sequence, since each value of 
X, permits only one value for x,-1. 


2. Classification by fixed points. The possible sequences satisfying 
(2) are related to the roots, 7; and re, of r?—er—m=0. 


THEOREM 2. Jf the roots are real and of unequal magnitude there are 
no sequences satisfying (2) or just two such sequences, each of period 
one. In the latter case r,; and re are integral and generate the two se- 
quences. 


Received by the editors May 7, 1949. 
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[August 
Proor. Suppose | rs| < | re| and write (2) as: 
(3) — T1 Yn — 12 
vi=n only if 7; is integral and the sequence is 7, 71, 71, - - - . Other- 


wise lim,..(7:/72)"=0, and lima =0, which 
is only possible if ya41=72. 

If the roots are real and equal except for sign, e=0 so that we may 
take y; as any divisor of m and the sequence becomes: 


If r:=r2=r, (3) is of no use and instead we define a sequence, 
Po, Pi, by po= 1, pPi=, Then: 
LEMMA 1. yn=Pn/Pn-1- 
PROOF. Set p,/Pn1=Qn- Then g:=y1, and the recurrence relation 
becomes: 
(4) Qn+29n+1Pn = €Qn+1Pn + MPn. 


Now, from the theory of second order recurrence sequences we know 
that the general solution for p, is p,=(An+B)r", where A and B 
are not both zero since pp = 1. Evidently p,=0 for at most one value 
n=t. For all values of m except #, (4) then reduces to 


= E+ 


Qn+1 

which is the same relation satisfied by the y; without exception. The 
g-sequence and the y-sequence therefore agree in their first +1 terms. 
Also, in both, the ¢+ 2nd term is equal to e, in the y-sequence because 
Ye41 is infinite, in the g-sequence by setting p;=0 in the second order 
recurrence relation and using the definition of g:,2. Therefore the two 
sequences agree in all their terms and the lemma is proved. 

Thus it is required that ((An+B)/(A(m—1)+B))r=y, should be 
integral for all values of m. But, if A=0, y,=r, and otherwise 


limn+« Yn=?, so that again the only possible sequence occurs when r 
is integral. This gives: 


THEOREM 3. If r:=1r2=r, there is just one sequence satisfying (2) or 
none according as r is or is not an integer. In the former case the se- 
quence is r, 7,7, 


Inasmuch as we have already seen an example of a sequence of 
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period six, we may expect the case of complex roots to be more 
difficult. For real roots, of course, it is hardly necessary to use (2) 
since (1) and the value of e give all the required information. 


3. Complex roots. Again we consider the linear recurrence relation: 
(5) Pati CPn + 


but this time define a sequence {p;} by the initial values po=0, 
pi=1. This is called the principal sequence, and its first seven terms 
are 0, 1, e, e*-+m, e?+2em, e*+3e?m+m?, Using this 
sequence, we obtain the following result: 


THeEoreM 4. If {y;} satisfies (2) and {p;} is the principal solution 
of (5), then 


Pn-191 + MPn—2 


Yn = 
The proof is by induction. 


THEOREM 5. The sequence {y;} satisfying (2) has a period w if and 
only if p»=0 


Proor. Suppose ~,=0. Then by (5), Pwis=mpv1 and from 
Theorem 4, yoi1=y1. On the other hand, suppose yui1=y1. Then: 


+ = Pwiidi + 
— (Puri — MPwo-1) — = 0 = — Mpw, 
pul — ey, — m) = 0, 


2 
1 


Pwy 


and since the factor in parentheses cannot be zero, p.=0. 


Of course the period of {y;} is the smallest value of w>0 for which 
Pw=0. 


Lemma 2.2 If {p;} is the principal solution of (5), for n>O pa is a 
homogeneous polynomial of degree [(n—1)/2] in the variables e* and m 
if n 1s odd, and e times such a polynomial if n is even. In both cases all 


the coefficients are integers and the coefficient of the highest power of e is 
unity. 


? The proof beginning here is not the one that most readily suggests itself. If p, 
is expressed explicitly in terms of the scalar roots, namely, as p,=(r;—1;)/(r1—12), 
the problem of finding its zeros reduces to that of determining the exponent of an 


element in a quadratic field. But the proof given above relies on simpler and some- 
what more relevant ideas. 
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The proof is by induction. 

It follows from Theorem 5 that if (2) is satisfied by some integral 
sequence {y;} with period w, the values of e and m from (2) make the 
homogeneous polynomial of Lemma 2 zero. Consequently e?/m is the 
root of a polynomial with integral coefficients and leading coefficient 
unity. Therefore, since the root is rational it must be integral and we 
have: 


LEMMA 3. p.=0 only if e?+km=0 for some integer, k. 
Not all values of k correspond to a w however, as is seen from: 


THEOREM 6. If r; and rz are complex, and if }y;} satisfies (2) with 
period w, then e?+-km=0 for one of four values of k, k=0,1, 2, or 3, 
corresponding respectively to w=2, 3, 4, or 6. 


Proor. Since the roots are complex, e?+4m <0 and m <0. There- 
fore e?+km>0 for k<0, and e?+km <0 for k=4. Only four values of 
k remain, and we obtain the corresponding periods from the factoriza- 
tion of the first seven terms of the sequence {;}. 


4. The complete period. We now know that any sequence { y} of 
integers satisfying (2) must be periodic, and conditions on e and m in 
(2)—or on a, b, c, and d in (1)—for periodicity. But these conditions 
do not guarantee that for an arbitrary value of yi, or even for any 
value of y;, the sequences {xs} and { ys} should be entirely integral. 
We establish certain preliminary theorems. 


TuHoreM 7. If w is the period of a sequence { y;} of integers satisfy- 
ing (2), then + Yo)? =(—m)”. 

PRooF. 7;=e+m/r; for i=1 and 2. Subtracting both these equa- 
tions from (2) and multiplying the results, we obtain: 


2 


— 71)(Ynti — 72) = (yn — 11)(Yn — 12) 


2 
T172Yn 


or, if Q(y) =y*—ey—m, 


Q(yn+1) = Q(yn)- 
Yn 


Multiplying these equations over a complete period we obtain the 
result, since Q(y) £0 for y real. 


Coroiary 1. If an integer s divides each term of {yi}, then s*|m 
and s\e. 
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Proor. From Theorem 7, s*|(—m)* or s?|m. Then, from Theorem 
6, sle. 


Coroiiary 2. If {x;} is a sequence of integers satisfying (1) and if 
s=(c, d), then s|a and s| b. Accordingly we may assume (c, d) =1. 


Proor. Every term of the sequence {y;} derived from {x;} is 
divisble by s. By Corollary 1, s|e=a+d, and therefore s| a. Writing 
(1) as 


Xn41(CXn + d) = ax, + 


we have s| b. 
DEFINITION. Let R(g, h) be the greatest divisor of g prime to h. 
Then, for example, R(6, 9) =2; R(9, 6) =1. 


THEOREM 8. An integral sequence {x;} satisfying (1) corresponds to 
an integral sequence |+;} satisfying (2) if and only if c is a divisor of 


Proor. If we begin with the sequence {x;}, then y:=cx,+d, 
y2=cx2+d, and c| (vi1—y2). By Corollary 2 we may assume that c is 
prime to d, consequently to each y;, and in particular to y:. 

Start, however, from a sequence {y,} and let c be any divisor of 
R(yi—y2, yi). Choose an arbitrary integer x, and define d by y1=cx 
+d. Then, since (c, y:)=1, (c, d)=1. Next, define a by means of 
e=a-+d. Now, since c| (v1 —2), =y2=d (mod c), and writing (2) as 


(6) Yn+1yn + ™m, 


we have d?=ed+m (mod c) by taking »=1, or, d(d—e)=m=—ad 
(mod c), using the definition of a. So, (m+ad)/c is an integer, say b. 
We thus have found integers, a, b, c, and d, such that, in (2), e=a+d 
and m=bc—ad. It remains to show that all the x; are integral. 
Obviously, this will be true if and only if y;=d (mod c) for each i. 
We already know this for i=1, and from (6), if y,=d (mod ¢), 


Yntid = (a + d)d — ad (mod c) = @? 
or, since (c, d) =1, yn4,=d (mod c). 


CoroLiary. For any given sequence {y;} of integers satisfying (2), 
R(yn—Yn+i, Yn) tS constant. 


Proor. Since {y;} is periodic, it does not matter which term we 
consider the first in defining a value of c as in Theorem 8. In particu- 
lar, the largest possible value of ¢c cannot change. 
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5. Parametric solution. Theorem 8 completely elucidates the con- 
nection between sequences satisfying (2) and sequences satisfying (1). 
We shall now give a parametric solution to the problem of finding all 
possible sequences satisfying (2). Evidently this is hardly necessary 
when 7; and rz are real or when e=0, so that by Theorem 6, k = —e?/m 
must assume one of the three values 1, 2, or 3. Introducing an 
integral parameter ¢, we must have, according to the value of &, 
e=tand m= -—+?#’, or e=2t and m= —2??, or e=3t and m= —3??. This 
gives the first part of the following theorem. 


THEOREM 9. If e?+4m <0 and eX0, the transformation (2) must take 
one of three forms: 


Yori = k(t — #/yn), k = 1, 2, or 3, 


where t is an integer. In addition, there must exist relatively 
prime integers, u and v, such that t is divisible by uv(u—v) if k=1, 
by either uv(u—v)(u—2v) or uv(u—v)(u—2v)/4 according as u is 
odd or even if k=2, or by either uv(u—v)(2u—3v)(u—2v)(u—3v) or 
uv(u—v)(2u—3v)(u—2v)(u—3v)/27 according as u is not or is di- 
visible by 3. In all cases y,=ut/v. 


Proor. The proof is much the same for any value of & and will be 
given for k=2. If a sequence {ys} of integers satisfies yn,1=2t 
—2i?/y,, we may certainly find relatively prime integers, u and », 
such that y,=ut/v. Then ye, ys, and y, may be calculated as: 


u—v u — 2v 
t and 2t— 
u u— 20 


2t 


It is easily seen that if the fraction u/v is in its lowest terms so are 
(u—v)/u, (u—2v)/(u—v), and v/(u—2v). Consequently, »v|t, 2¢, 
(u—»)|t, and (u—2v) | 2¢. If u is odd, u, v, wu—v, and u—2v have no 
factor in common since (uw, v)=1. If u is even, (u, u—2v)=2. The 
theorem follows. 

EXAMPLE. Take u=4, v=3. Then # must be divisible by 
4-3-1-(—2)/4=—6. A possible value for ¢ is #=12 which gives 
91 = 16, the transformation being: y,41 = 24 — 288/y,, and the sequence: 
16, 6, —24, 36, 16, ---, 91) = R(10, 16) =5. Thus, to con- 
struct a sequence satisfying (1) we may take c=5, and perhaps— 
the choice is arbitrary—x, = 3; then d=1, a=e—d=23, b=(m+ad)/c 
= —53. The transformation is therefore x,4;= (23x, —53)/(5x,+1), 
and the sequence: 3, 1, —5, 7,3,---. 
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MINIMAL IDENTITIES FOR ALGEBRAS 
A. S. AMITSUR AND J. LEVITZKI 


1. Introduction. Let F be the underlying field of an algebra A and 


X1, X2,°**,%m a set of indeterminates. Consider a polynomial 
f(x1,--+,%m) over F, that is, an element of the free algebra 
F(x1, - + + , Xm) generated by the indeterminates x; over the field F. 
If this polynomial is not identically zero and if the equation 

(1) +++, = 0 


is satisfied by all elements of A, then we say (see [1]") that the 
polynomial identity (1) holds in A. Such identities are satisfied, for 
example, by each algebra of finite dimensionality.? An identity of 
minimal degree will be called a minimal identity, and the correspond- 
ing polynomial a minimal polynomial.’ In a previous communication 
(see [4]) it has been shown by one of the present authors that the de- 
gree of the minimal identities for the total matric-algebra A, of all n 
by matrices is at least 2m. In the present note we show (§2) that 
this degree is exactly 2n, and that we may choose as a polynomial 
identity one having the form 


where the sum on the right ranges over all permutations of 2 letters 
and the sign is positive for even permutations and negative for odd 
permutations. We shall use for (2) occasionally also the shorter nota- 
tion S:,(x). Polynomials of type (2) were previously studied in the 
same context by I. Kaplansky [2] and F. W. Levi [3], and we adopt 
for these polynomials and the corresponding identities the terms 
standard polynomials, respectively standard identities, suggested by 
Kaplansky. 

In §3 we deal with the problem of determining all minimal poly- 
nomials of A,. It turns out that all minimal polynomials are homo- 
geneous and linear in each of the indeterminates. Moreover, if 
%1, X2, °°, Xz is a given set of indeterminates, then minimal poly- 
nomials depending on the x’s only may be constructed if and only if 


Received by the editors May 8, 1949. 
1 Numbers in brackets refer to the bibliography at the end of the paper. 
* For further details compare [1] and [4]. 


3 Minimal polynomials in the ordinary sense will not be referred to in this paper, 
so that no confusion will arise. 
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k22n and the set of all minimal polynomials in the x’s forms (if 
supplemented by zero) a module over the underlying field whose 
dimensionality is Cy,2.. As a basis for this module we may choose 
the C;.2, standard polynomials S(x;,, - - - , xi,,), where (t, - , tn) 
is an arbitrary combination of 2m letters out of k. For k=2n we have 
in particular the following uniqueness theorem: The minimal poly- 
nomial depending on no more than 2 indeterminates is uniquely de- 
termined (up to a numerical factor) as the standard polynomial 
Son(x). The case »S2, F=prime field of characteristic 2 requires a 
special treatment. 

In §4 we apply our results to general algebras of finite dimen- 
sionality. 


2. The main theorem. We begin with a few remarks concerning 
some simple properties of the standard polynomials which will be 
needed in the proof of the main theorem. 

REMARK 1. Since the standard polynomial is homogeneous and 
linear in each of its indeterminates, we have 


S(x1, au + Bo, °° -) 
= aS (x1, Xi, u, Xan) + BS(x1, Xan). 


REMARK 2. The standard polynomial vanishes if two of the inde- 
terminates are identified. 


REMARK 3. If (i, - - - , 4m) is any permutation of m letters, then 

REMARK 4. Denote by (i, - - - , im) any permutation of m letters, 
and by S’ the sum of all terms in S,(x) having in common 
the left factor xi,x;,---x;, where r<m, then S’=+4;,x;, x3, 
° ** Ma)- 

REMARK 5. Denote by a, a2, - - - , dn aset of m matrices of order m, 


such that the rth row and the rth column of each of these matrices 
consists of zeros. Denote by a/ the matrix derived from a; by deleting 
the rth row and the rth column. If S(a, ---, an) =0, then also 
S(ai, - ++, a),)=0 and conversely. This is a consequence of the fact 
that the correspondence a—a’ determines an isomorphism between 
the rings generated by the a; and the a/ respectively. 

REMARK 6. If an algebra satisfies an identity S,,(x)=0, then it 
satisfies also each identity S;(x) =0 with k>m. 


Lemma 1. Jf for an odd positive integer r we put y=Xis1 
and if S’ denotes the sum of all terms of S,»(x) containing the common 
factor y, then 
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(3) = S(x1, Xi, y; Tm). 


Proor. The right member of (3) is a sum of terms with coefficients 
+1 each being a product of m—r+1 factors. If p and g are any two 
such terms, then let 7 denote the permutation transforming / into 
q. If y is replaced by the product x;41 - - - x:,,, then p and g become 
products of m factors each, and the permutation z induces a permuta- 
tion 7’ of these m factors. Relation (3) will be proved if we show that 
these two permutations are of the same type, that is, that 7 and 7’ 
are even or odd simultaneously. It is evidently sufficient to show that 
if w is a transposition, then 7’ is odd. Now this is evident in case 7 
does not involve the factor y, since in this case 7’ is also a transposi- 
tion. Suppose now that w permutes y and x;, and denote by & the 
number of factors separating y from x;. It follows easily that the 
permutation 7’ may be resolved into k(r+1)+, transpositions of 
adjacent factors. Since r is odd, also k(r+1)+r is odd, that is, the 
permutation 7’ is odd, q.e.d. 

We now turn to the total matric-algebra A, and introduce the n? 
matric-units e;;, which are subject to the relations 


0 
and consider sets of 2” matric-units 
(5) Cigien Cian ions 
Our chief concern will be to investigate the quantity 


and to show that this quantity vanishes. It will be convenient to 
introduce a function f(u) defined for 1 Su <n which denotes the num- 
ber of occurrences of u« among the 4m subscripts in (5). Evidently 


(7) 


u=1 


We now prove the following lemma. 


LEMMA 2. Suppose that the identity Son_2(x) =0 holds for any system 
of 2n—2 units in An. Suppose further that at least one of the units (5) 
is idempotent, and that for this unit eu, we have f(u) <4. Then the 
quantity (6) is equal to 0. 


Proor. In case any two of the units (5) are equal, this is true by 
Remark 2, and thus we may assume that all units in (5) are dif- 
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ferent from each other. Since 2 <f(u) $4, we have to consider three 
cases. The case f(u) =2 is trivial, since here the idempotent e¢,,, is the 
only unit with a subscript equal to u, and hence by (4) each term in 
(6) vanishes. In case f(u) =3, we have aside from e,, a unit of the 
form é,s(resp. while the remaining units have no sub- 
scripts equal to u. By (4) it follows that the only terms which are 
possibly not equal to 0 are those with the left (resp. right) factor 
Cuuuk (FeSP. Cku€uu). If therefore we denote the remaining 2”—2 
factors by e™, e@, - - - , e"-®), it follows in view of Remark 4 that 
(6) is equal to (resp. =+S(e™,---, 
e@"—?))¢,,€u,). Since none of the units e has a subscript equal to x, 
it follows by the assumption of the lemma in view of Remark 5 that 
S(e™, - - - , e@*-2)) =0, and hence also (6) vanishes. It remains to 
deal with the case f(u) =4. Here we have exactly three units having 
a subscript equal to wu. If these units are of the form éuu, éuz, éu; or of 
the form éuu, €ku, €ju, then by (4) each term in (6) vanishes, that is, 
(6) is equal to 0. We may therefore assume that the three units with 
the subscript u are of the form éuu, @us, @ju. Denote the remaining 
2n—3 units by e, ---, e@*-®% and define the three partial sums 
S’, S’’, S’"’ of (6) as follows: S’ is the sum of all terms with the left 
factor éyv€.z and the right factor ej; S’’ is the sum of all terms with 
the right factor éj.€u, and the left factor e,.; S’’’ is the sum of all 
terms with the common factor @ju€uueur =e. Since by (4) all other 
terms in (6) vanish, we have 


(8) Cian ian) = S’ + + 


Each nonzero term of S’ is of the form @uueuz@eju = teuu, where a is a 
product of 2n—3 units. To this term uniquely corresponds a term 
with an opposite sign belonging to S’’, namely F €up@éju€uu = F Cuu- 
The signs of the corresponding terms are indeed opposite since the 
number of factors is even (=2m) and hence the cyclic permutation 
which transforms the corresponding terms in one another is odd. 
Hence it follows that S’+.S’’=0. As to the partial sum S’’’ we have 
in view of Remark 3 and Lemma 1 


(9) = S(e e), e(2n—3)) 
Since the subscripts of the 2n—2 units participating in (9) are dif- 


ferent from u, it follows by the assumption of the lemma in view of 
Remark 5 that S’” =0. This completes the proof of the iemma. 


LEMMA 3. Suppose that Sn2(x) =0 is valid for any system of 2n—2 
units in An1. Suppose further that (6) vanishes whenever the number of 
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the idempotents among the units (5) is not less than r+-1, where r=0. 
Then (6) also vanishes whenever the number of idempotents in (5) is 
equal to r. 


ProoF. As in the proof of Lemma 2 we easily dispense with the case 
that any of the units are equal, and in case r>0 may assume without 
loss of generality that the idempotents are ey, é22, ---, érr. If for © 
some subscript uw such that 1 Sur we have f(u) $4, then (6) van- 
ishes by Lemma 2. We may therefore restrict our attention to the 
case that f(u)=5 for u<r and consider the following two cases: 

Case I. There exists in (5) a unit ex such that i>r, k>r. This 
holds, for example, for each unit in case r=0. Consider the matrix 
c=e+e%. This matrix is idempotent of rank 1 and the elements 


(10) €11, °° » Orr, © 


form an orthogonal system of r+1 idempotents,‘ each having the rank 

Case II. For each unit e, and e,; with 1 >r we have k Sr. This implies, 
for example, that r= 1. Since f(1) 25, it follows that for somei such that 
4>r we must have f(z) $3. We now reduce this to the case where we 
may assume that for some 7 we even have f(t) $2. To this end first 
note that it follows by (4) that if some term in (6) is not equal to 0, 
then either each f/*1) is even, or f(u) is even for 2n—2 indices and odd 
for 2 indices. Suppose now that f(u)23 for all u’s and f(z) =3 for 
some 7. Then either each term in (6) vanishes and our lemma is 
proved, or else we are confronted by the following two possibilities: 

(1) f(a) =f(j) =3 for a pair of indices 1+j, and f(u) =4 for u¥z, 7. 

(2) f(t) =3, and f(u) 24 for 

We may assume in the first of these two cases that f(1) is even, and 
since f(1)=5, we must have f(1)26, and thus it follows in both 
cases in view of (7) that we must have r=1. This implies that each 
unit is either of the form ej or é,, which leaves us with at most 
2n—1 units instead of with 2m. This contradiction shows that in Case 
II we may indeed assume that for some 7 we have 


(11) i>r, fi) 


If for this ¢ we have f(z) =0, our lemma is true by Remark 5. Thus 
Case II is reduced to the case where we have a unit ey (or é;) so that 
(11) holds and k Sr. We assume without loss of generality that k=1. By 
putting c=e;+en (respectively and (respec- 
tively d=e,—e:;), we get the following orthogonal set of r+1 


4 In case r=0, this system reduces to the single idempotent c. 
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idempotents of rank 1 
(12) d, €22, * 5 Crr, C. 


In view of the properties of systems (10) and (12) it follows that there 
exists a regular matrix @ with coefficients in F which transforms (10), 
respectively (12), into the system of r+1 idempotents 


(13) C11, * * * Crry r+i- 


In both Cases I and II we may write our units in the form ey, -- - , 


Crr, Ciz, , (in Case II we must eventually replace e;, 
by e:) and we have in view of Remark 3 


If in (14) we replace ey by e;; we shall have r+1 idempotent units 
participating, and hence by the assumption of the lemma 


(15) S(éu, * » Crry e), = 0. 


If further in Case II we replace ey by ei, we shall have two equal 
units, and hence by Remark 2 


From (14) and (15) we obtain by addition 

while in Case II subtraction of (16) from (14) yields 


In Case II our system of 2” units may be written in the form 
(19) C11, * » Crry City 


Since we have chosen e,; so that (11) holds, it follows that if in (19) 
we replace é,; by e;; and é1 by ea, we shall get a system of 2” units with 
an idempotent e;; such that f(t) $4, and hence by Lemma 2 


From (15) and (20) we obtain by subtraction 

(21) S(d, €22, °° + Grr, EO, = 0 

while from (18) and (21) we get by addition 

(22) + Chenin) = S(d, €22,- Cre, EM, , 


Transforming now (17), respectively (22), with the matrix a defined 


= 
r 
| 
t 
oO 
i 
t 
| 
} 


1950] MINIMAL IDENTITIES FOR ALGEBRAS 455 


above, we obtain therefore in Case I as well as in Case II in view of 


the properties of a in relation to the three orthogonal systems (12), 
(15), and (14) 


(23) + (ei; Cis, inn) 
= S(ei, * * Crry a eq, 
Now express each of the matrices a~'ea, j=1, ---, 2n—r—1,asa 


linear form of the n? units e,:, then by Remark 1 we shall be able to 


represent the right member of (23) as a sum of terms each having the 
form 


where a@ is in the underlying field. Since the system of 2” units par- 
ticipating in (24) contains at least r+1 idempotents, it follows by the 
assumption of our lemma that (24) vanishes. Hence the left member 


of (23) is equal to 0, and this in turn shows that also (6) is equal to 0, 
q.e.d. 


Lemma 4. For each n the standard identity S2,(x)=0 is satisfied by 
each set (5) of 2n units. 


Proor. We prove this lemma by induction. For =1 this is trivial. 
Suppose now that 2>1 and that the lemma is true for each positive 
integer less than n, then we have to show that (6) vanishes for an 
arbitrary system (5) of 2” units. Now by Remark 2 the quantity 
(6) vanishes if all 2” units in (5) are idempotent. Hence by repeatedly 
applying Lemma 3 it follows that (6) vanishes no matter how many 


of the units (5)—if any—are idempotent. This completes the proof of 
the lemma. 


THEOREM 1 (THE MAIN THEOREM). For each n the standard identity 
Son(x) =0 holds for the complete matric-algebra 


Proor. Denote by ai, a2, - - - , da, a set of 2” arbitrary matrices 
belonging to A,. We have to show that 
(25) S(a1, den) = 0. 


To this end express each a; as a linear form (with coefficients in the 
underlying field) of the nm? units ex. By Remark 1 we may express 
the left member of (25) as a sum of terms each having the form 
* * * Where @ is in the underlying field. By Lemma 4 
each of these terms vanishes, and hence (25) holds. 


3. Uniqueness. Each homogeneous polynomial identity of degree 
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m which is linear in each of its indeterminates x, - - - , x, may be 
written in the form 


where a) is in F and the sum on the left of (26) ranges over all 
permutations (4) of m letters. Some of the a;;) may of course be equal 
to 0. We prove 


THEOREM 2. [f the total matric-algebra A,, satisfies an identity (26) 
with m S2n, then m=2n and the left member of (26) is but for a numer- 
ical factor belonging to F the standard polynomial Son(x). 


Proor.' In [4] it has been shown that a minimal polynomial of A, 
is at least of degree 27, hence we have m=2n. Let us now compare 
two terms of the left member of (26) which but for the coefficients 
may be transformed in each other by transposing a pair of adjacent 
factors. We may assume without loss of generality that these terms 
are of the form 


In case r is odd, that is, r=2i—1, we perform the substitution 
= Cj = C77 for j < i, 
= Xai = Csi, 
= = Cj-1 5 for j > 1. 


As is readily seen, under this substitution the sum of the two terms 
(27) yields aéin+ Bein =(a2+8)éin, while each other term vanishes. 
Hence we have (a+8)e:1,=0, or 


(28) a+p=0. 
In case r is even, that is, r= 27, we apply the substitution 
= C541 tty = for j <4, 
= = Csi, 
= C7 = for i<j Sn —1, 
= Cnny Xen = 


Under this substitution the sum of the two terms (27) yields ae + fen 
=(a+8)eu, while the only other summands which are possibly not 


5 This theorem was proved independently also by G. Goldring of Jerusalem 
University. 
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equal to 0 are those which correspond to the cyclic permutations of 
the product éy2¢22 - - - @nn€ni- Hence it follows that the left member of 
(26) has the form (a+f)eu+ > fi2aies. Since (26) holds, each 
coefficient of this sum must vanish, and this leads again to (28). 
Consider now an arbitrary term ay)x;, - - - x;,, of the left member of 
(26). Since each permutation (7) may be derived from the identical 
permutation by successive transpositions of adjacent letters, it fol- 
lows in view of (28) that ay = +a, where the sign is positive or nega- 
tive according as the permutation (7) is even or odd. Hence 
Xian = AS2n(x), q.e.d. 

We now consider a polynomial f(x, x2, - - - , X=) whose degree in 


one of its indeterminates, say x;, is equal to r>1. By applying the 
transformation 


Sly + 2, Ha, °° *, Xm) — f(y, °°* Hm) — Sm) 
= g(y, z, , Xm) 


one obtains a polynomial whose general degree is not higher than that 
of f(x1, - - - , Xm) and whose degree in y as well as in 2 is exactly r—1. 
By successive steps of this kind one obtains a polynomial whose gen- 
eral degree is not higher than that of f(x, x2, - - - , Xm) and which is 
linear in each of its indeterminates (see [1, Lemma 2]). A nonlinear 
polynomial will be referred to in short as almost linear if each trans- 
formation of type (29) performed on this polynomial yields either 
zero or a polynomial linear in each of its indeterminates. One obtains 
easily the following lemmas. 


(29) 


Lemma 5. A nonlinear polynomial ts either almost linear or it may be 


transformed into an almost linear polynomial by a finite number of opera- 
tions of type (29). 


LEMMA 6. A nonlinear polynomial f =f(x1, x2, - Xm) ts almost 
linear if and only if it has the form 


(30) fe Df 


where fo is of degree not greater than 1 in each x; while each f;, i>0, ts 
either of degree 2 in x; and of degree not greater than 1 in each xx, ki, 
or identically equal to 0. Finally, f;40 for some +>0. 


We shall need also the following lemma whose proof may also be 
omitted. 


Lemma 7. Suppose that f=fi+f2, where each monomial of fi, resp. fa, 
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is of degree not less than 1, resp. of degree 0 in x:. If identity f=0 
holds in an algebra A, then both identities f,=0, fo=0 hold in A. 


THEOREM 3. Let h(y1, - - - , yx) =0 be a minimal identity for the 
total matric-algebra A, such that k S2n. If either n>2 or the underlying 
field is not the prime field of characteristic 2, then k=2n and h is but for 
a numerical factor the standard polynomial S2,(y). 


Proor. We know by Theorem 1 and by [4] that the degree of h is 
2n. The case where h is linear in each y may be dispensed with® by 
Theorem 2 with the help of Lemma 7. We therefore assume that h is 
not linear. Since operations of type (29) transform polynomial 
identities into polynomial identities, it follows in view of Lemmas 5 
and 6 that our identity may be transformed into an identity (1) 
whose corresponding polynomial f is almost linear, that is, of the form 
(30). By Lemma 7 we may assume that f;, 120, is of degree not less 
than 1 in each x (or f;=0), fo has general degree 2n —1 (or fy =0), and 
fi, #21, is of general degree 2m in case f;~0. Thus it follows that we 
may put m=2n—1. We assume finally without impairing the gen- 
erality of the case that f;~0. Let us apply now the transformation 
(29). Since each f;, 11, is either equal to 0 or linear in x; we get 


gly, 2, %2,°°°, Xan—1) = + 


(31) 


The polynomial thus obtained is evidently homogeneous and linear 
in each of its indeterminates, and its degree is equal to 2”. Hence by 
Theorem 2 


Our theorem will be proved if we arrive at the contradictory conclu- 
sion that a,;=0. To this end consider an arbitrary monomial of f; 
and write it in the form yax;bx,c, where y € F and a, b, ¢ are products 
of certain x’s which are not equal to x;. Under transformation (29) 
this monomial yields —yazbzc = yaybzc 
+-yazbyc, that is, two monomials with the same coefficient y.’? Since 
on the other hand these monomials may be transformed into each 
other by transposing the factors y and 2, it follows in view of (32) 
that they must have coefficients equal to +a; with opposite signs, 


6 It follows that if h is linear in each x, then it must be homogeneous. Otherwise 
we would obtain by Lemma 7 an identity of degree less than 2”, which by [4] is 
impossible. 

7 Apparently neither of these two monomials can be checked against any other 
monomial. 
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that is, 
(33) —-y=y=2t a1 


In case the characteristic of F is not equal to 2, it follows from (33) 
that a,=0, which completes the proof of the theorem in this case. In 
case the characteristic is equal to 2, it follows from (33) that y =a, 
that is, all coefficients of f; are equal. We get a similar result for each 
fi, +21. Denote by p:=pi(m1, - - - , Xen-1) for +21 the sum of all 
(2n)!/2 monomials which have degree equal to 2 in x; and degree 1 
in each x,, k#%, then f;=a;p;, a relation which holds also for f;=0 
(with a;=0). Thus we have 


2n—1 
i=1 
Now apply the following substitution, 
(35) = Ci; #=1,---,m; Sei = Ci #=z1,---,#-—1. 
The only monomial linear in each x; which under substitution (35) 
yields the unit @, is x1%2 - + -Xe,-1. Each other monomial of this kind 
vanishes. Hence fo(eu, é12, - - -)=Béin, where BEF. Since under 
(35) we get x3,=0, x%_,=e%, it follows easily that po(en, - - - ) 


=(Q; ) By (34) we have 0=f(en, ) 
@2;-1€1n, Which implies B+ Q2;_;=0 or, since the char- 
acteristic is equal to 2, 


im 
Now apply the substitution 
(37) = Ci i441, i= 1,---,n— 1; Xai = Ci, i= 
= Cnl- 


The only monomial linear in each x which under (37) yields the unit 


én is again - - and hence it follows that the coefficient of 
éu in the matrix fo(é:2, éx2, - - - ) is equal to 8. Since under (37) we 
have x3,_, =0, x3,=e::, it follows that en- pei(er, - - while 
€11° €22, =O. Hence the coefficient of in the matrix 
f(ér2, is B+ ae: and we have 
(38) B= > ax. 

t=1 


Finally apply the substitution 
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Xi = #=1,---,n-—1; Zn = Onn; 
(39) +1 


Xi = Con—i+1,2n-i; i= + 2n—1 


(thus for n=3: x1=€12, X2=€23, X3=€33, X4=€32, The mono- 
participating in the f’s which yield the unit e,; all other monomials 
yield either zero or units not equal to ey. Hence it follows that the 
coefficient of é in the matrix f(é12, - - - , én) is equal to 8+a,, that is, 


(40) B = an. 
If m is odd, that is, m»=2k—1, we have by (36) and (40) 


Since the odd indices may be arbitrarily rearranged without altering 
the right member of (41), we obtain 


If m is even, that is, m»=2k, we have by (38) and (40) 
n—1 

(43) = ar 
t=1 


which by rearranging the even indices yields 
(44) a= = 


In case n>2 it follows from (42) and (44) by rearranging the indices 
that all coefficients are equal, ajy=a2= - + - =Q2n_1. By adding (36) 
and (38) we have 28= > or ai:=0, which 
is the required contradiction. It remains to deal with the case 
where 32, the field F has characteristic 2, and F contains an 
element 6 such that 60, 641. Since all f;, i#1, are linear in x, 
we have in case m=2 the relation f(5x:, x2, x3) —8f(x:, x3) = 
=a6(5—1)pi(x1, x2, x3), while for »=1 we obtain f(5x:) —6f(x) 
=a,5(5—1)x}. Since 5(5—1)+0, this implies the validity of the 
identity a:pi(x1, x2, x3)=0, respectively axij=0. By substituting 
M1=eu, respectively x1=1 one gets aifi(eu, e12, 
= = = 0, respectively a,1=0, that is, in both cases the 
desired contradiction a; =0. This completes the proof of the theorem. 


THEOREM 4. If either n>2 or F is not the prime field of character- 
istic 2, if further f(x1, x2, +++, Xm) is a@ minimal polynomial of An, 
then m2 2n and f has the form 


| 
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where the sum ranges over all Cy,2, combinations (i) of 2n letters out of 
m, and the a) are in the underlying field. 


Proor. If m<2n, then by Theorem 3 we have m=2n and 
(x1, Xm) =aS(x1, - - - , Xen), in which case (45) is valid. Suppose 
now that m>2n and consider an arbitrary monomial a,x, - - - xx, of 
F, where a0 and each x;, is some x;. We do not assume linearity be- 
forehand, so that some of the factors may coincide. Suppose that 
Xiyy * are the distinct factors participating in this mono- 
mial, and denote by fi, respectively fe, the sum of all monomials 
which depend on those s factors only, respectively the sum of the 
remaining monomials, that is, f=fi+/f2. By Lemma 7 it follows that 
both identities f; =0, f2=0 hold in A,. Since the degree of f is equal to 
2n, we have s Sr S2n, and hence in view of Theorem 3 it follows that 
s=2n and fi(xi,, =arS(xi,, - - , Xig,). We may repeat 
now the same argument with fe, and thus proceeding, after a finite 
number of steps obtain the required representation (45). 

It follows readily that the C,,,2, standard polynomials defined above 
are linearly independent, and since each linear combination of these 


polynomials yields a minimal polynomial (or zero), we have the 
following theorem. 


THEOREM 5. If either n>2 or F is not the prime field of characteristic 
2, and if x1, X2, + +, Xm is a set of indeterminates, then minimal poly- 
nomtals of A, depending on the x’s only exist if and only tf m22n. 
The set of all such polynomials forms (if supplemented by 0) a module 
over whose F dimensionality is equal to Cnn. As a basis for this module 


we may choose the Cy.2, standard polynomials defined in the previous 
theorem. 


The exceptional case where m $2 and F is the prime field of char- 
acteristic 2 is settled by the following theorem. 


THEOREM 6. The condition of Theorems 3, 4, 5 is necessary for the 
validity of these theorems, or in other words: The total matric-algebras 
A; and Az over the prime field of characteristic 2 possess nonlinear 
minimal identities, for example, x+-x*=0, respectively 


(46) ay + yxy? + yay + ye + xy? + = 0. 


Proor. Denote by e the unit of Az. Each element of A: satisfies one 
of the following 4 equations 
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Y=yte. 


By substituting each of these equations into the polynomial on the 
left of (46), one finds that this polynomial vanishes for each x. This 
completes the proof of the theorem, the case of A; being trivial. 
REMARK. It remains to be settled by which processes—if at all— 
each polynomial identity of A, may be derived from the minimal ones. 


4. Algebras. If A is an algebra over F, and B=A XG, where G is 
a field containing F, then evidently each identity which holds in B 
holds also in A. On the other hand, if an identity linear in each of its 
indeterminates holds in A, then this identity holds also in B (see 
[1, Lemma 3]). In view of Theorems 1 and 2 we have therefore the 
following theorem. 


THEOREM 7. If A is a simple algebra of degree n* over its centre F, then 
the degree of a minimal identity satisfied by A is exactly 2n. A minimal 
identity depending on 2n indeterminates only and linear in each of 
these indeterminates x1, X2, +++, Xen ts uniquely determined but for a 
numerical factor as the standard identity S2,(x) =0. 


Consider now a semi-simple algebra A and suppose that A is the 
direct sum of the ideals A’ and A’’, each of which satisfies an identity 
(1). Then identity (1) holds also in A. Indeed, let e’ and e’’ be the 
units of A’, resp. A’’ so that e=e’+e’’ is the unit of A, and consider a 
set of m elements a), a2, - , in A. We have then f(a, da, - , Gm) 
=f(e’a,,---, ---, €’"dm). Since (1) holds in A’ as 
well as in A’’, each of these summands vanishes, and we have in fact 
f(a, - - - , @m) =0. This leads to the following theorem. 


THEOREM 8. Let A be a semi-simple algebra and A’, A’’,---, A® 
its simple constituents. Denote by nj the order of A‘ over its centre. If 
n*=max (ni, - - - , nz), then the degree of any minimal polynomial of A 
ts exactly 2n. If a minimal polynomial depends on 2n indeterminates 
only and is linear® in each of these indeterminates x1, -- - , Xan, then 
this polynomial is uniquely determined but for a numerical factor as the 
standard polynomial S2n(x). 


For an arbitrary algebra of finite dimensionality we have the fol- 
lowing theorem. 


THEOREM 9. Let r be the index of the radical N of an algebra B, and 
n?=max (nj, +--+, mz) where the n? are the orders of the simple con- 


5 As will be shown in a following communication Theorems 3-6 also retain their 
validity for general semi-simple algebras. 
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stituents of the difference algebra A—N. Then B satisfies the following 
identity of degree 2nk. 
[S(a1, +--+, an) ]* = 0. 


Proor. By Theorem 8 the difference algebra A—WN satisfies the 


identity S2,(x) =0. If bi, - - - , be, is a set of 2m elements belonging to 
B, we evidently have the relation S(hi, ---,be,)€EN; hence, 
[S(bi, - - , ben) 

Remark. If 7=1,---, 20; j7=1,---,k, is a set of 2nk in- 


determinates, then evidently also the identity 


j=1 


holds in B. This identity is homogeneous and linear in each of its 
indeterminates. 
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JERUSALEM UNIVERSITY 


ON MATRICES WHOSE CHARACTERISTIC EQUATIONS 
ARE IDENTICAL 


W. V. PARKER 


In a previous paper [1]! it was shown that if A and C are matrices 
such that ACA=0 and B is an arbitrary matrix, then AB and 
A(B+C) have the same characteristic equation. It is the purpose of 
this note to prove two theorems, each of which gives the above re- 
sult as a special case. 


THEOREM 1. Let A be an nXm matrix of rank r<n and let C be 
an mXn matrix such that ACA=kA (k a scalar). If B is an mXn 
matrix, the characteristic equation of AB is x*-"$(x) =0 and the char- 
acteristic equation of A(B+-C) is x**$(x—k) =0. 


Let P and Q be nonsingular matrices such that 


=> => an = 
0 0 B; B, C3 C4 


where B, and C, are square matrices of order r. Then since ACA =kA, 


and hence C,=&lI,. 


Also 
0 Ci C2 RI, C2 
0 OJ \Cs Ca, 0 0 
and 
I, O\/Bi B B, B 
0 O/\B; B 0 O 
and hence 


P(AB + AC)P> = 


0 0 


Presented to the Society, September 2, 1949; received by the editors February 16, 


1949 and, in revised form, May 19, 1949. 
’ Numbers in brackets refer to the references cited at the end of the paper. 
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It follows that the characteristic equation of AB is x*~"¢(x) =0 and 
the characteristic equation of A(B+C) is x*-*¢(x—k)=0 where 
(x) =0 is the characteristic equation of B,. The result of the previous 
paper follows if k=0. 

In case k~0 it may as well be assumed that k=1s0 that ACA =A 
Set U=AB, E=AC and V=U+E. Then E?=E, EU=U, and EV 
=V. Hence V?U=(U+1)*U and U*V=(V—1)*V so that f(V)U 
=f(U+1)U and f(U) V=f(V—1) V for all polynomials f(x). Let g(x) 
be the minimum function of U and h(x) the minimum function of V. 
Then g(V—1)V=g(U)V=0 and A(U+1)U=h(V)U=0 so that 
h(x) | xg(x —1) and g(x) | xh(x+1). Hence the minimum functions of 
U and V satisfy one of the four relations: 


(1) h(x) = g(x — 1); (2) 1)A(x) = xg(x — 1); 
(3) (% — 1)h(x) = g(x — 1); (4) h(x) = xg(x — 1). 


That all four relations are actually possible may be shown by ex- 
amples. 


THEOREM 2. If M and N are square matrices such that NM (or 
MN) = N*=0, then M and M+N have the same characteristic equation. 


Since N?=0, JN is similar to 


0, I, 0 
0, 
00 0 


where ¢ is the rank of N. There is no loss of generality in assuming 
that N is in this form. Then since NM=0 


M, M; 
M= (0 0, O 

M, M; M; 
where M, and M; are square matrices of order r. Any principal minor 
of M which contains elements of M:2 also contains a row of zeros from 
rows r+ 1 to 2r. Hence the characteristic equation of M is independent 
of the elements of M2. But Mand M+N differ only in the elements of 
M; and therefore their characteristic equations are identical. If 
MN=WN?=0, then N’M’=N’2=0, so that M’ and M’+N’ have the 
same characteristic equation. But the characteristic equation of any 


matrix is the same as that of its transposed matrix and the proof of 
the theorem is complete. 
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If ACA =0 then (AC)(AB) =(AC)*?=0 and hence AB and A(B+C) 
have the same characteristic equation. 

It is well known that if A and B are square matrices, AB and BA 
have the same characteristic equation and if either A or B is non- 
singular, the two products are similar. Roth [2] has pointed out that 
if both A and B are singular, the products may be similar or not. 

With P and Q as defined in Theorem 1 


B, B; By 0 0 
PABP+ ( )=-( )+ ( )- M+ N, 
0 0 0 


where Ni and 


where MN, = N3=0. 
Hence 
AB = P'N,P=K+S5S 
where SA =S*=0 and 
BA = 
where AT=T?=0. Since K=P"*MP=AB-—-S and L=QMQ" 
=BA-—T the following theorem is established. 


THEOREM 3. If A and B are square matrices, there exist matrices S 
and T such that SA = S*=AT=T?=0 and such that AB—S is similar 
to BA—T. 
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SIMPLY CONNECTED HOMOGENEOUS SPACES 
DEANE MONTGOMERY 


Introduction. Let G be a topological group which acts transitively, 
as a transformation group, on a space M. If x is any point of M, let 
G, be the closed subgroup made up of all elements in G which leave 
x fixed. There is a natural one-to-one correspondence between points 
of M and points of the coset space or homogeneous space G/G,. This 
one-to-one correspondence is not a homeomorphism in the most 
general cases, but it is clearly a homeomorphism if G is a compact 
Lie group, and it is also a homeomorphism if G is a Lie group and M 


is a manifold [1]. In the present note the following theorem will be 
proved: 


THEOREM A. If G is a connected Lie group which acts transitively 
on a compact manifold M, and if G, ts connected, then G contains a 
compact subgroup which acts transitively on M. 


Because of the facts stated in the introduction this theorem could 
be given an equivalent formulation in the slightly different language 
of homogeneous spaces, but this will be omitted. When M is simply 


connected, G, must be connected so that the theorem implies the 
following: 


CorROoLiary 1. If G is a connected Lie group which acts transitively 
on a compact simply connected manifold M, then G contains a compact 
subgroup which also acts transitively on M. 


PROOF OF THE THEOREM. The proof depends on a theorem about 
Lie groups which was proved in essence by Cartan, Chevalley, and 
Malcev which will now be stated. 

If G ts a connected Lie group, then G ts the direct product, as a space, 
of a compact subgroup K and a euclidean space E. 

For a proof and references to original sources see [2]. It is true that 
maximal compact subgroups of G exist which contain any given com- 
pact subgroup, that any two such are conjugate, and that K may be 
chosen as one of these maximal compact groups. 

In view of this theorem let the group G of Theorem A be the direct 
product, as a space, of K and E 


G=K-E. 


Received by the editors May 21, 1949. 
1 Numbers in brackets refer to the bibliography at the end of the paper. 
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Since G, is connected, G, is the direct product of a compact sub- 
group L and a euclidean space F 


= L-F. 


It may be assumed that L C K. 
Let f; be the map from G/L to G/K under which each coset of L 
is carried to the coset of K which contains it. 


fi: G/L—-G/K. 


This is a fiber mapping in the sense, for example, that each point of 
G/K is in a neighborhood whose inverse is a product of a fiber and 
a cross section. Each fiber is homeomorphic to K/L. But G/K is 
homeomorphic to E, a euclidean space of some dimension, and when a 
fibering has such a base space, it is a product. Therefore G/L is 
homeomorphic to the topological product of K/L and E, which is 
written as 


(1) G/L = K/L X E. 


Let f2 be the map from G/L to G/G,= M under which each coset of 
L is carried to the coset of G, in which it is contained. 


fe: G/L—G/G.z. 


This is also a fiber mapping and each of the fibers is homeomorphic to 

G,/L, that is, to F, a euclidean space. But when all the fibers are 

homeomorphic to euclidean space it is known that a cross section A 

exists. The set A is a closed set in G/L, touching each fiber of fe 

precisely once, and A is homeomorphic to the base space G/G,= M. 
There exists a mapping P which retracts G/L onto A 


P: G/L—-A, 


that is, P is defined on all of G/L to A and each point A is fixed under 
P. It follows that if a cycle in A bounds in G/L, it also bounds in A. 
The set A is a manifold and hence contains a cycle 2, at least mod 2, 
such that 
dim z = dim A 

and z does not bound in A. Therefore z does not bound in G/L. 

From (1) it follows that the homology properties of G/L are the 
same as those of K/L, and therefore dim M=dim A=dim 2Sdim 
K/L. 

Since L is a maximal compact group in G,, it follows that 


L= K(\Gz. 
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Then K(x), the orbit of x under K, is homeomorphic to K/L so that 
dim M 2 dim K(x) = dim K/L 2 dim M. 
Since K(x) is a manifold, 
K(x) = M 
and this completes the proof of the theorem. 


CoROLLARY 2. If G is a connected Lie group which acts transitively 
on a compact manifold M, and if G, has a finite number of components, 
then G contains a compact subgroup which also acts transitively on M. 


Coro.iary 3. If G is a connected Lie group which acts transitively 
on a compact manifold M, where M has a finite fundamental group, then 
G has a compact subgroup which also acts transitively on M. 


Under the hypothesis of Corollaries 2 and 3, G, has at most a finite 
number of components. Let G} be the identity component of G,. 
Since G/G? is a finite covering of G/G, and G/G, is compact, it fol- 
lows that G/G? is compact. By Theorem A, G contains a compact 
group K such that 


But then 


which proves Corollary 2. 

As a matter of fact it is not necessary to assume in Theorem A that 
G is connected, for if G has a countable number of components and is 
transitive on M, then the identity component of G is also transitive 
on M. The same remark applies to the corollaries. 

The Klein bottle is a homogeneous space of a connected Lie group 
(Mostow) but not of a compact Lie group, which shows that some 
such extra hypothesis as the one used here is necessary. 
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ROOTS AND FACTORS OF ORDINALS 
PHILIP W. CARRUTH 


1. Introduction. An ordinal X is termed a root of an ordinal A of 
index B if X®=A. In §2 of this paper we employ the Cantor normal 
form! of an ordinal and make extensive use of a result of Sherman’s, 
[3, Theorem 2.1], to determine necessary and sufficient conditions for 
one ordinal to be a root of another. By means of these conditions it 
is shown that the set of roots of an ordinal is a closed set (that is, it 
contains the least upper bound of each subset of it), and thus that two 
ordinals either have no root in common or a greatest common root. 
Also an immediate consequence of these conditions is the known re- 
sult [1, p. 191, (13), Theorem } that an ordinal has an infinite number 
of roots if and only if it is indecomposable.? We also give necessary 
and sufficient conditions for an ordinal to have a greatest root less 
than itself. 

In §3, we use results of Jacobsthal [1 and 2] to prove a theorem of 
which the fact that the set of left factors of an ordinal is closed 
(proved by other methods in [3] and [6]) and our result in §2 that 
the set of roots of an ordinal is closed are simple corollaries. Al- 
though the proof in this section is shorter than that in §2, the method 
used in §2 has the advantage of giving additional information con- 
cerning the roots of an ordinal, and it further demonstrates the use- 
fulness of the Cantor normal form as a tool in investigations of this 
sort. 

§4 is concerned with the decomposition of an ordinal into the prod- 
uct of prime factors. An ordinal is prime if it is not the product of two 
ordinals less than itself. It is known [1, p. 183, (17), Theorem] that 
an ordinal is prime if and only if it is one of the following: a prime 
integer, one greater than a transfinite indecomposable ordinal, an 
indecomposable power of w. An ordinal can be written as the product 
of a finite number of prime factors, but this representation is not 
always unique. Sieczka [4] and Jacobsthal [1, p. 184, Theorem] 


Presented to the Society, October 30, 1948; received by the editors October 21, 
1948 and, in revised form, May 28, 1949. 

1See [5] for information concerning the Cantor normal form and arithmetic 
operations with ordinals. Henceforth in this paper, whenever an ordinal is written as 
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showed that every ordinal has a unique prime factorization provided 
that certain additional conditions are satisfied by the factorization. 
We give some alternative additional conditions that insure the 
uniqueness of the prime factorization. The prime factorization satis- 


fying these conditions has the possible advantage of using the mini- 
mum number of factors. 


2. Powers and roots of ordinals. Throughout this section we let 
A= > a; >0, a(0) = ec; or 0, X= > 1, 
and B=o+n21, where g is a limit ordinal or 0 and z is a non-nega- 
tive integer. 

We shall define three types of ordinals A. The definitions will be so 
made that an ordinal will have a root less than itself if and only if 
it is of at least one of the three types. The division of such ordinals 
into types is strictly formal and is only necessary so that the state- 
ments of theorems concerning conditions for one ordinal to be a root 
of another will not become overly involved. 

A is of type I if: 

1. s=0. 

2. do is an integral power, greater than the first, of an integer 
greater than 1, and/or a(0)>0. 

A is of type II if s<0 and if there exist an integer 7, OS jr, anda 
factor p; of c; such that: 

1. j7>0 and/or p;>¢;. 

It is noted that the integer j described for type II is not unique; 
namely, let A = w*?+?+w*?t!, Then j could be either 0 (in which case 
p; could be taken to be 1) or 1 (in which case p; could be taken to be 
2). 

A is of type III if s>0, and if there exist integers 7, OSj<r, and 
m and m such that: 

1. j7>0 and/or m<s. 

2. mn=s. 

3. a(km) = we; n+ for OSkSn—-1 
(¢j,.0=¢;). 

4. c;,./(n—k) =); is an integer for OSk Sn—1. 

5. a(s) = 

6. If m>1, —a(km)+a([k—1]m+i) = —a(hm) +a([h—1]m+14) 
for 1Sk, and1sism-1. 


3 Throughout this paper, whenever the upper limit of summation for the Cantor 
normal form of an ordinal is less than the lower limit, it is to be understood that the 
ordinal is 0. 
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7. If n>1, for 1SkSn—1. 
8. If m>1, for OSRk, hSon—1 and 1<i<m-—1. 
is an example of an ordinal of type III. In this case 7=1, m=2, and 
n=2. 

Condition (5) for type III implies that the integer 7 described for 
that type is unique. However, m and m are not uniquely determined; 
namely let A = w*+w*+w?+w+1. Then j=0 and both m and n could 
be 2 or m could be 1 and could be 4. 

It is noted that A could be of both types II and III; namely, let 
A =" tet14e*te, On the other hand, A =w*t!+w®* is of type III 
but not of type Il; A =w‘+q* is of type II but not of type III. 


THEOREM 1. Let B be greater than 1, and let X® =A. Then A is of 
at least one of the types 1, II, and III. Specifically: 

1. If m=0 and/or B=c, then A is of type 1.4 

2. If m>0, n>0, and B(m)>0, then A is of type II. 

3. If m>0, n>0, and B(m) =0, then A is of type III. 


PRooF OF (1). (1) is easily verified if X and B are integers. If X is 
an integer and B2w, then X°t*=(X*)*X"=oX", where wu=c. 
(It is easily verified that w is a left factor of every limit ordinal.) 

If X2w, m=0, and n>0, then (w®bo)? = wy is of type I. 

If and then ( = is of type I. 

Proor oF (2). In the proof of (2) and in further proofs we shall 
make use of [3, Theorem 2.1] which states that a necessary and suffi- 
cient condition that an ordinal Y be a left factor of an ordinal 
D= wd; is that either 0< Y¥<w™ or Y=o%%e;+ 
w*®d;, where e; is a factor of d; and 0SjSk. 


t=0 


Hence a(0) =8(0)B. Since 10, it follows from [3, Theorem 2.1] 
that there exist an integer 7, OSj<r, and a factor p; of c; such that 
B(0) = p;+ %c;. The proof of [3, Theorem 2.1] implies 
that B= > Since B>1, 7>0 and/or p;<c;. 
The fact that a(s) =8(0)(B—1)+(s) implies that a(s) >8(0)(B—1). 
Therefore condition (2) for type II is satisfied. 

PROOF OF (3). 


4 See also [1, p. 190, Corollary 1]. 
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XB = ABH, 4+ 4... 
(B-1)48(m-B 
+ FO + 4... 
+ PO C+D + +... 


& 
= Fora, 
t=0 


It is apparent that mn=s. Since B(0)B=a(0) and n¥0, [3, Theorem 
2.1] implies that there are an integer j, OSj<r, and a factor p; of c; 
such that B(0) It can be verified now that 
B= > e;-+-n. Since B>1, 7>0 and/or m<s. 
B(0)(B—k) =a(km) for OSkSn. Therefore a(km)= wre; 


41=0 
p+ wc; and c;4=p;(n—k) for OSkSn—1. 
B(0)o =a(s),= > w%c;, as can be verified by direct computa- 
tion. 


Since a([k—1]m+i) =a(km) +8(4) for 1Sk Sn and 1<is<m-—1, if 
m > 1, it is seen that condition (6) for type III is satisfied. Conditions 
(7) and (8) are easily verified to be satisfied. 


THEOREM 2. Let A be of type 1. Then X is a root of A tf and only if 
one of the following conditions is satisfied: 

1. X*=do, where n is a non-negative integer, and X < w. 

2. 0<6(0) <w™, tf ao=1. 

3. B(0) >0, m=0, bo =ao, B(0) p;+ where p; ts 
a factor of c; and OSjSr. 

If (1) is satisfied, the index B= If (2) is satisfied, 
B= >i-9 w-**\c;, where r is the degree of B(0) (that is, the greatest 
ordinal that appears as an exponent of w in the Cantor normal form of 
B(0)). If (3) is satisfied, B= 


Proor. Assume that (1), (2), or (3) is satisfied. Then using the 
corresponding value given for B, one can verify by direct computa- 
tion that X?=A. 

Now assume that X?=4 and that B+1. (If B=1, (1) or (3) is seen 
to hold.) The proof of (1) of Theorem 1 implies that if X is an integer, 
(1) must be satisfied. 

If 8(0) >0, then the proof of (1) of Theorem 1 implies that either 
ao=1 and £(0) is a left factor of a(0) leaving a quotient that is a 
limit ordinal or that m=0, do=bo, and B(0) is a left factor of a(0) 
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leaving a quotient that is not a limit ordinal. In the first case, [3, 
Theorem 2.1] implies that 0<8(0) <w””. In the second case, this 
theorem implies that 6(0) w%c;, where p; is a 
factor of c; and OSjSr. 


THEOREM 3. Let A be of type II but not of type 111. Then X is a root 
of A if and only if X =A or the following conditions are satisfied: 

1. m=s, b;=a; for OSiSs. 

2. B(O) =w where j and p; satisfy the require- 
ments for the integers j and p; described for type 11. 

3. B(Rk) = —B(0)( > 32) +10 +a(k) for 0<kSm. 

If (1), (2), and (3) are satisfied, then the index B= > {=}, w-V'P+1e, 
+¢;/p;. 


Proor. Assume that (1), (2), and (3) are all satisfied. Then using 
the value given for B, one can verify by direct computation that 
X8=A. 

Now assume that X=4A and that B¥1. Then by Theorem 1, X is 
a limit ordinal and m>0. By the proof of (2) of Theorem 1, it is seen 
that (1) is satisfied and that 8(0) is a left factor of a(0) leaving a 
quotient that is not a limit ordinal. Hence [3, Theorem 2.1] implies 
that =w1p;+ where p; is a factor of c; and 
0Sj<r, and that B has the value as stated in our theorem. Since 
B>1,j>0 and/or p;<c;. The proof of (2) of Theorem 1 also implies 
that 6(0)(B—1)+ =a(4) for 0<ism. Thus (2) and (3) of our 
theorem are seen to be satisfied. 


THEOREM 4. Let A be of type III but not of type Il. Then X is a 
root of A if and only if X=A or the following conditions are satisfied: 

1. s=mn, where m satisfies the requirements for the integer m de- 
scribed for type III. 

2. B(0) =w™p;+ wc;, where j is the unique inieger satis- 
fying the requirements for the integer j described for type III. 

3. pj=C;,4/(n—k) for OS kSn—-1. 

4. If m>1, B(t)= —a(km) +a([k —1]m+i) for 1SkSn and 1S7 
=<m-—1. 

5. B(m) =0. 

6. bo =d0, bn =a,, and if n>1, bobm=Aim for 1SkSn—1. 

7. If m>1, for OSkSn—1 and 1Sism-—t. 

If X satisfies these seven conditions, the index B= b Ro: w-WAt1WW¢; 


+n. 


Proor. Assume that the conditions (1) through (7) are satisfied. 
Using the value given for B, one can compute X¥. The normal form 


| 
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for X* will be that stated in the proof of (3) of Theorem 1. (2) and (3) 
of this theorem together imply that 6(0)(B—k) =a(km) for OSkSn. 
This fact together with (4) implies that if m>1, 8(0)(B—k)+8(4) 
=a([k—1]m+i) for 1SkSn and 1SiSm-—1. (6) and (7) now are 
sufficient to show that X?=4A. 

Now assume that X?=A and that B¥1. Then by Theorem 1, 
m>0O, B(m) =0, and B is not a limit ordinal. From the proof of (3) 
of Theorem 1 it is seen that s=mn, and 8(0)B=a(0). Hence [3, 
Theorem 2.1] implies that B(0) =" p;+ where p; 
is a factor of c; and O<j<r, and that B= 
Since B>1, j7>0, and/or m<s. Since B(0)(B—k) =a(km) for 0Sk 
<n-—1, by direct computation it is observed that a(km) = > 47) wc; 
+e; 4+ wc; for these values of k, where c;,.=p;(n—k). 
Since 8(0)(B—n)=a(s), direct computation shows that 7 is the 
unique integer described for type IIT. 

The proof of (3) of Theorem 1 implies that if m>1, B(t) = —a(km) 
+a([k—1]m+i) for 1SkSn and 1<tSm-—1, and also that (6) and 
(7) must hold. These facts together imply that m satisfies the re- 
quirements of the integer m described for type III. 


THEOREM 5. Let A be of both types II and I11. Then X is a root of A 
if and only if one of the following three conditions is satisfied: 

1. X=A. 

2. X satisfies conditions (1), (2), and (3) of Theorem 3. 

3. X satisfies conditions (1) through (7) of Theorem 4. 


THEOREM 6. An ordinal has a root less than itself if and only tf it is 
of at least one of the types 1, II, and III. 


ProorF. This is an immediate consequence of Theorems 1 through 5. 


THEOREM 7. The set of roots of an ordinal is closed. 


ProorF. Let S be the set of roots of A. If A is of type I and a(0) =0, 
then S consists of a finite number of integers. 

If A is of type I, a(0) >0, ao>1, then S consists of a finite number 
of integers and of a finite number of ordinals that are greater than or 
equal to w, since in ) 4-9 wc; there is only a finite number of 
coefficients, and each coefficient has only a finite set of factors. 

If A is of type I, a(0) >0, and ao=1, then S consists of all ordinals 
greater than 1 and less than or equal to w raised to the power w? 
and of at most a finite number of ordinals greater than w raised to the 
power 

If A is of type II or of type III, S consists of a finite number of 


) 
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ordinals greater than w. 


CoroLiary 1. The set of roots of an ordinal is infinite if and only if 
the ordinal is indecomposable.® 


ProoF. This is an immediate consequence of the proof of Theorem 
7 and the fact that the set of indecomposable ordinals is just the set of 
powers of w. 


CoROLLARY 2. A has a greatest root less than itself if and only if it is 
of types I, Il, or III and is not equal to an indecomposable power of w. 


Proor. This follows immediately from Theorem 6 and the proof 
of Theorem 7. 


THEOREM 8. Two ordinals have no common root or a greatest common 
root. 


ProorF. This follows directly from Theorem 7, since the intersection 
(if not void) of the two sets of roots of two given ordinals will contain 


its least upper bound, which will be therefore the greatest common 
root of the two ordinals. 


3. Ordinal functions. We begin this section by giving a few defini- 
tions and results from [1] and [2]. Following Jacobsthal [1], we let 
Bo be a fixed ordinal and we let correspond to each ordinal a=8o an 
ordinal w(a) having the property that if a>a’=fo, then w(a) 
>w(a’). Now let & and 70 be two fixed ordinals. We let correspond 
to each pair of ordinals £2, 72>mo0 an ordinal g(£, 7) having the 
property that g(t, »)>£. Jacobsthal proved [1, Theorem II] that 
corresponding to given functions w(a) and g(£, 7) there is a fixed 
ordinal \ and a function f(a, 8) defined for a2, B21 with the 
properties: 

1. f(a, 1) =w(a). 

2. fla, B-+1) 6), a). 

3. f(a, B) =limg f(a, 8), if B is a limit ordinal and 8 on the right side 
of equation (3) runs through all ordinals 8 <8 =lim 8. 

The function g is called the stem function of f. A function defined as 
f is called a C-function. Throughout this section, w(a) will denote 
a function defined as is the function w(a) of [1, Theorem II], f will 
denote the corresponding C-function, and g will denote the stem 
function of f. Whenever the symbol f(a, 8) is used, it will be assumed 
that B21 and that a2X, where X is the ordinal \ described in [1, 
Theorem II]. It also will be assumed that g satisfies the following 


® See also [1, p. 191, (13), Theorem]. 
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condition: 


Conpition A. If §>£’>£ and then g(é, n)>g(t’, 7’) 
and 9’) n’). 


THEOREM 9. Let f(a, 1) =a, and let g=f, be a C-function satisfying 
the condition f,(fi(a, 8), vy) =fila, fi(B, y)). Then for fixed +, the set of 
all solutions a of the equation f(a, B) =¥ is either vacuous or closed. 


ProoF. Let Z be the set of all solutions a of the equation f(a, 8) 
=¥, and let Z; be any subset of Z. We assume that Z; is not vacuous. 
Let 6 be the smallest ordinal that is greater than or equal to every 
ordinal in Z;. Suppose that 4 is not in Z. 

Let o be the smallest ordinal satisfying the equation f(a, ¢) =7, 
for a in Z;. Let r be the smallest ordinal in Z; satisfying the equation 
f(t, =¥. 

[1, Theorem III] states that if B>§’, then f(a, 8)>f(a, 8’) and 
therefore that if f(a, 8)>f(a, B’), then B>8’. [1, Theorem VII] 
states that if aZza’, then f(a, B)=f(a’, B), and if f(a, B)=f(a’, B) 
for a>a’, then £ is a limit ordinal. Consequently it is seen that for 
a>r in Z,, f(a, ¢)=~¥. Since according to our supposition there is 
no greatest element in Z,, [1, Theorem VII] also implies that o is 
a limit ordinal. 

Jacobsthal proved in [2, Theorem VIII] that under the hypotheses 
of our theorem, if ¢ is a fixed limit ordinal, then the set of all ordinals 
a satisfying the equation f(a, ¢) =y has a limit a, and the equation 
f(a, o1)=~7 has a solution o;. According to our supposition a,+6. 
Hence 5<a,, and there exist two ordinals, a’ and a’’, such that 
a’ <i<a’’ and such that f(a’, ¢) =f(a’’, ¢) =y. But then it follows 
from [1, Theorem VII] that f(5, ¢) =. This gives us a contradiction 
to our supposition. 

The following corollary has been proved elsewhere by other meth- 
ods [6, Theorem] and [3, Corollary 2.4]. 


CoroLuary 1. The set of left factors of an ordinal is closed. 
ProoF. Let g(a, 8) =f;(a, 8) =a+8, and let f(a, 8) 
CoROLLARY 2. The set of roots of an ordinal is closed. 


ProoF. Let g(a, 8) =fi(a, 8) =a8, and let f(a, 8B) 
We prove another result of which [6, Lemma] is a corollary. 


THEOREM 10. Let f satisfy the condition g(f(a, 8), f(a, m)) =f(a, B+n) 
for finite n. Let o be a fixed ordinal and r a fixed limit ordinal. If 
p ts a solution of the equation g(5, p)=f(o, r) and if 5<f(e, T), then 
p2f(e, w). 
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Proor. [1, Theorem VI] states that if 8 is given and if @ is an 
ordinal such that w(a) $8, then there is a unique ordinal £ such that 
f(a, E+1)>B2f(a, &). This implies that since r is a limit ordinal, 
there exists an ordinal such that f(c, >8. Hence f(c, 7) 
<g(f(c, u), p) by Condition A. 

Suppose that p<f(¢, w). Then [1, Theorem VI] implies that there 
is an integer m such that p<f(c, m). Therefore f(¢, r)<g(f(c, yu), 
n)) =f(¢, by hypothesis and Condition A. Since p+n<r 
[1, Theorem III] implies that this is a contradiction. 


CoROLLARY 1. Let f(a, 1) =a, and let g satisfy the two conditions, 
g(g(a, B), v) =g(a, g(B, v)) and gla, lim B) =limg g(a, 8). Let o bea 
fixed ordinal and r a fixed limit ordinal. If p is a solution of the equation 
g(5, p) =f(c, r) and if 5<f(c, 7), then p=f(o, w). 


Proor. This is an immediate consequence of [2, Theorem VI], 
which states that under the hypotheses of the corollary, it follows 


that g(f(a, B), f(a, y)) =f(a, 


COROLLARY 2. Let p be a remainder greater than 0 of or, where tr is a 
limit ordinal. Then 


CoROLLARY 3. Let p be a right factor greater than 1 of 0°, where r is a 
limit ordinal. Then p2=o*. 


4. Prime factorization. We revert to the notation of §2. We also 
let 7(b) stand for the number of prime factors in the unique prime 
factorization of the integer b>1. We let r(1) =0. 


THEOREM 11. An ordinal can be decomposed uniquely into a product 
of a finite number of prime factors if and only if it is not a limit ordinal. 
If A is not a limit ordinal, the number of prime factors in the unique 
decomposition of A is s+ 


ProoF. If A is a limit ordinal, then A =wR=2wR, and so the de- 
composition can not be unique. 

Now assume that a(s) =0 and that s>0. If a,=1, [3, Theorem 2.1] 
and [1, p. 183, (17), Theorem] imply that the only prime left factor 
of A is w**-)+1. In this case the quotient is te(%q,; 
+4,-1. 

If a,>1, [3, Theorem 2.1] implies that the only prime left factors 
of A are the prime integral factors of a,. If p is a prime factor of a,, 
then A =p( 

Since the quotient in each case again is not a limit ordinal and since 
prime factorization of integers is unique, a simple induction argument 
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shows that the prime factorization of A is unique. It is easily seen 
that the number of prime factors is s+ } 4.» r(a,). 


LemmMA. Let an ordinal greater than 1 be indecomposable and not 
prime. Then it can be decomposed uniquely into the product, P,P2, of 
two prime factors if the following additional condition is imposed: 
QP:<PiP:2 for every prime ordinal Q that is less than P,. 


ProoFr. Let A be greater than 1, indecomposable, and not prime. 
Then the proof of [3, Theorem 2.1] implies that A can be decom- 
posed into the product, Q,Q2, of two prime factors only if Q2 is equal 
to w raised to the power w™”. Let Q; =o or o +1. Then Q,Q2 is equal 
to w raised to the power un+w”™. Q,0.=A if and only if pt+ta™ 
= wc;; that is, where 6= (c,—1), 
and 0S7<w”. Hence our lemma is proved. In fact, P2 is equal to 
w raised to the power w”™”, and P,=w' if 6 is indecomposable, and 
P,='+1 if 5 is decomposable. 


THEOREM 12. A can be factored uniquely into the product, P\P2---, 
of a finite number of prime ordinals if the following conditions are 
imposed: 

1. The number, r(A), of prime factors is the minimum into the prod- 
uct of which A can be decomposed. 

2. If P2ts a limit ordinal, then QP2<P,P2 for every prime ordinal Q 
that is less than P,. 

If A is not a limit ordinal, r(A)=s+ ) 3-5 r(a;). If a(s) és greater 
than 0 and indecomposable, r(A)=s+1+ > io r(a,). If a(s) is de- 
composable, =s+2+ r(a,). 


ProoF. If A is not a limit ordinal, the results follow from Theorem 
11 and its proof. 

Now let a(s) be greater than 0. By [3, Theorem 2.1], the only 
possible prime left factors of A are those prime ordinals less than or 
equal to w*). If Q; is a prime left factor of A and if A=Q,B, then 
B= eis wq;, where B(s) 20. Hence in order to factor A into the 
product of a finite number of prime factors, A must be factored as 
Qi: On where o(s) =0 and is a positive integer, and 
Qi ---Q,=w*. To factor A into the product of the minimum 
number of prime factors, must be made a minimum. 

If a(s) is indecomposable, n will equal 1 if and only if we take Q, 
equal to Then Since, by 
Theorem 11, C has a unique prime factorization, the factorization of 
A satisfying the required conditions is unique, and r(A)=s+1 


+ Yi-or(ai). 
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Now let a(s) be decomposable. Then w* is not prime, and so 
n>i1. Our lemma and Theorem 11 imply that A can be factored 
uniquely into the product, P,P: ---, of s+2+ r(a;) prime 
factors, where P; is a limit ordinal, with the condition that QP: be 
less than P,P; for every prime ordinal Q that is less than P. 


COROLLARY. A can be factored uniquely into the product of a finite 
number of prime ordinals with the single restriction that the number of 
prime factors be the minimum into the product of which A can be de- 
composed if and only if one of the following conditions is satisfied: 

1. a(s) =0; that is, A ts not a limit ordinal. 

2. a(s) ts indecomposable; that is, A=DE, where D is an inde- 
composable power of w, and E is not a limit ordinal. 

3. a(s) ts one greater than a decomposable ordinal; that is A= DwE, 
where D is a decomposable power of w, and E is not a limit ordinal. 


ProoF. This follows from Theorem 11 and the proof of our lemma 
and Theorem 12. In the case where a(s) is decomposable, the proof 
of the lemma implies that w*®) can be factored uniquely into the 
product of two prime factors with no further restriction, if and only if 
= (w+ 1)w=at!, where p is decomposable. 
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ON THE DECOMPOSITION OF ORTHOGONALITIES 
INTO SYMMETRIES 


PETER SCHERK 


1. Let § be a field of characteristic # 2, and let R, denote 
the space of all column vectors over § with 2 components. In the 
following, Greek letters denote elements of §, while small italics 
[#m, n,r, s| stand for vectors in ®,, and m-rowed squared matrices 
over § are denoted by capital letters. A prime indicates transposition. 

Let G be a fixed regular symmetric matrix. Thus 


G=G, |G|#0. 

Two vectors a and 3 are called perpendicular if a’Gb=0. Two sub- 
spaces R* and R** are perpendicular if x«’Gy =0 for all x CR*, yCR**. 
Obviously, these relations are symmetric. The vectors perpendicular 
to a given vector respectively to a given m-space form an (m—1)- 
space, respectively (7 —m)-space. 

We call the matrix T orthogonal if it leaves the expression x’Gy un- 
changed for all x and y. This condition is equivalent to 


(1) T'GT =G. 
If in addition 
rank (T — I) = 1, 


T is called a symmetry (cf. Lemma 2; J=unit matrix). 

Cartan proved that every orthogonality can be decomposed into a 
product of x or less than symmetries. A proof of his theorem can be 
found in Dieudonné’s book.! 

The purpose of this note is to show that the minimum number of 
symmetries into which an orthogonality T can be decomposed is in 
general equal to 


m = rank (T — J). 


An exception occurs if and only if G(T —J) is skew-symmetric. In 
that case, this minimum number is equal to m+2. For a detailed 
description of this case cf. the last part of this note. 


2. Lemma 1. The following three sets of properties of a matrix A are 
equivalent: 


Presented to the Society, September 2, 1949; received by the editors June 14, 1949. 


1 Dieudonné, Sur les groupes classiques, Actualités Scientifiques et Industrielles 
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(a) rank (A—I)=1, A?=T; 
(b) 1 ts an (n—1)-fold, —1 a simple eigenvalue of A; 
(c) There are two vectors a and b such that 


(2) A=I1+ ab’ 
and 
(3) b’a = — 2. 


Proor. Obviously 
rank (A — J) = 1 
(4) is an (m — 1)-fold eigenvalue of A 
(2) holds for suitable a ~ 0, 6 ¥ 0. 


(a)—(b): The first part of (b) implies that A has exactly one other 
eigenvalue a, and this eigenvalue is simple. From Ax =ax it follows 
by means of (a) that 


x=Ix= A*x = A-ax=a:Ax = 
Since and a= —1. 
(b)—(c): From our assumptions, there exists an x ~0 so that 
= (A + = (21 + ab’)x = 2x + 
Thus @ and x are linearly dependent. We may choose x=a and 


obtain (2+6’a)a=0 and therefore (3). 
(c)—(a): From (3) 


A? = (I + ab’)(I + ab’) = I + 2ab’ + a(b’a)b’ = I + 2ab’ — 2ad’= I. 
We call the vector a isotropic if a’Ga=0. 


LEMMA 2. The following three sets of properties of a matrix A are 
equivalent: 


(a) A is orthogonal, rank (A —I)=1; 
(b) There exists a non-isotropic vector a such that 
2aa’'G 


5) A=I[- 
a'Ga 


(c) A maps some non-tsotropic vector a on —a and every vector per- 
pendicular to a on ttself. 


Proor. We first observe that (5) is equivalent to the combined 
three statements (2), (3), and 
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(6) Ga and 6 are linearly dependent. 
(a)—>(b): By means of (4) we obtain (2). Thus, A being orthogonal, 


we have 
0 = A'GA —G = (I + ba’)G(I + ab’) —G 

or 
(7) = b-a/G(I + ab’) + Ga-d’. 
This formula implies (6). Substituting Ga=Xd into (7), we obtain 

= + ab’) + Abb’ = + 
hence 

(2 + b’a)-bb’ = 0. 

G being regular, \+0. Thus (3) is also satisfied. 

(c)—>(b): The assumptions (b) of Lemma 1 hold. This implies (2) 
and (3). From (2), Ax =x is equivalent to b’x =0, and from our as- 
sumptions, it is also equivalent to a’Gx=0. Thus (6) also holds. 

Obviously, (a) and (c) follow from (b). 

We had defined symmetries as matrices possessing the properties 


(a). Thus Lemma 2 gives us two alternate definitions. From Lemma 1, 
we obtain the following 


CoroLuary. If A is a symmetry, then 


(8) A? = I, 
LEMMA 3. 

(9) rank (AB — I) S rank (A — I) + rank (B— I). 
Proor. Put 


r = rank (A — J), s = rank (B — J). 


The vectors x with Ax =x, respectively Bx =x, form an (n—r)-space, 
respectively (m—s)-space. If x lies in the intersection of these two 
subspaces, then A Bx = Ax =x. Thus this intersection lies in the eigen- 
space of AB belonging to the eigenvalue 1. The dimension of this 
eigenspace is therefore greater than or equal to that of this inter- 
section. Hence it is greater than or equal to n—r—s. This implies (9). 

If we apply (9) repeatedly to a product T of m symmetries, we get 


rank (T — J) Sm. 


Thus we obtain the following 
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COROLLARY. An orthogonality T cannot be the product of less than 
rank (T—I) symmetries. 


3. Lemma 4. Let 


(10) rank S > 1, 

(11) S+ S’ £0. 

Then, there exists a vector b such that 

(12) b'Sb 0 

and 

(13) S+S'# (Sb-b'S + S’b-b'S’). 


b’Sb 
Proor. We have 
(14) + S'’)x = + = + (x’Sx)’ = 22’Sx. 


Thus, on account of (11), there are vectors 5 satisfying (12). We may 
assume that at least one vector }; exists that is a solution of both (12) 
and 


1 
15 S+ = —— (Sb-b’S + S’b-b’S’). 
(15) + ys ( + ) 
Put 
(16) c= Si, d = S’b,, a= by 


Then from (12) 
(17) c #0, d #0, a= bic = db{d #0, 
and from (15) 


1 
(18) S+S’ = —(cd’ + dc’). 
Thus, (14) and (18) imply 
1 1 
= — x/(S + S’)x = —c'x-d'x. 
2 a 


In particular, the quadric 
Q = = 0 


is identical with the pair of [not necessarily different ] (m —1)-spaces 


& 
| 
| 


1950] DECOMPOSITION OF ORTHOGONALITIES INTO SYMMETRIES 485 


c'x=0 and d’x=0. 

Now let } be any solution of both (12) and (15). Then Sb and S’bare 
different from zero, and comparing (15) with (18), we see that either 
Sb isa multiple of c and S’b is one of d, or Sb isa multiple of d, while 
S’b is one of c. 

We first consider the case that a solution 5, of (12) and (15) exists 
such that Sb, is not a multiple of c. Then Sb, is a multiple of d, and 
the vectors c and d are linearly independent. Replacing }. by a suit- 
able multiple, we may assume 


(19) Sb. = d, S’be = Be, B # 0. 
Substituting b=), into (15), we obtain 
= (dc’ + cd’), 
bz 
and therefore, on account of (18), 
bz She = af. 


Thus (19) implies 
(20) bid = af, bic=a. 
The vectors 
S(t h)=ctd 


are multiples neither of c nor of d [cf. (16) and (19) ]. Hence the two 
vectors b,+52 cannot solve the system (12), (15). Now, from (16), 
(17), (19), (20) 


(b: + b2)’S(bi + b2) — (01 — 52)’S(b1 — 52) 
= Sbe + = 2(b{d + bic) = 2(a+ a) = 40 0. 


Hence, at least one of the two vectors 5; + be satisfies (12). As it can- 
not satisfy (15), it is a solution of both (12) and (13). 

Suppose now that every solution b of (12) and (15) is mapped by 
Son a multiple of c. The set of all vectors x for which Sx is a multiple 
of c form a subspace ®, of R,. From (10), R, is a proper subspace of 
R,. Hence its dimension is not greater than n—1. It suffices to show 
that there are vectors in ,, that belong neither to R, nor to the 
quadric Q. 

We had 6; CR. — Q. Since Q was a pair of (m —1)-spaces, there exists 
a vector b,.CQ—R-.. Thus, the straight line 


(21) b = (1 — db + Ade 


— 
= 
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lies neither in R, nor in Q. Hence it has exactly one point in common 
with ®. and at most two points with the pair © of (m—1)-spaces. 
Altogether, the straight line (21) meets the union .+Q in not more 
than three points. If § is not the prime field §, of three elements, this 
line contains more than three points. In particular, it contains points 
outside of R.+Q. 

If § =%:, then an (n—1)-space contains 3"~' points. Since ®, has a 
dimension not greater than nm—1 and since ®, and O have the origin 
in common, the set #.+0 contains less than 3.3"-! points, thus 
fewer points than the whole ®,. Hence, there are points outside of 


R.+0.? 


LEMMA 5. Suppose T is orthogonal, G(T —I) is not skew-symmetric, 
and 


m = rank (T — I) > 1. 


Then there exists an orthogonality U such that 
(a) T is the product of U by a symmetry, 
(b) rank (U—I)=m-1, 

(c) G(U—T) ts not skew-symmetric. 


Proor. Put 
T =T-I, S 
We rewrite the orthogonality definition (1) in terms of T) and S: 
T'GT —G = (T+ 1) -G=0 
or 
(22) + S+S’ = 0. 
G being regular, we have 
rank S = rank Ty) = m > 1. 


Thus S satisfies the assumptions of Lemma 4, and there is a vector } 
such that 


(23) b'Sb #0 


and 


2 The proof of Lemma 4 can be simplified considerably if the case } =¥; is ex- 
cluded. If § is the real field, this lemma is trivial. The matrix Sb-b’S+S’b-b’S, 
—b’Sb(S+5S’) depends continuously on b. If it vanishes outside of the quadric 
identically, then it will also vanish on OQ. Hence Sb-b’S+.S’b- b’S’ =0 for all bCQ. 
Therefore, © would be contained in the pair of at most (7 —2)-dimensional subspaces 
Sb =0 and S’b =0, which is impossible. 


i 
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1 
24 S + S' —— (Sb-U'S + S’b-b'S’). 
(24) rsp + ) 


Let R,»-» be the eigenspace belonging to the eigenvalue 1 of T. 
From the regularity of G and of T=7T)+J] 


(25) C Rum Tox = =00S'x =0 (22)]. 
In particular, from (23), 
(26) b 

Define 
(27) a = T»b. 
Then, from (25), 
(28) a’Gx = b'S’x = 0 for all x C Ma_m. 
Furthermore, (27), (22), and (14) yield 

a’'Ga = W'T{GTob = — + S’)b = — 20'S. 
Thus, from (27) and (23) 
| (29) a’'Ga = — 20'Sb = — 20'Ga 0. 

We now put 

(30) 
a'Ga 


Thus, A is a symmetry [cf. Lemma 2]; and with A and T, U is an 
orthogonality. From (8) 


T = A*T = AU. 


We now verify that U also has the required properties (b) and (c). 
If xCR»_m, then on account of (28) 


2a’Gx 


a'Ga 


2%. 


Furthermore, we obtain from (30), (27), and (29) 
2a’Gb 


a'Ga a’'Ga 


(T — A)b = (7.4 


hence 
Ub = ATb = A% = b. 


ee vere 
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Thus the eigenspace of U belonging to the eigenvalue 1 contains 
both R,-» and b. As b¢Rz—m [cf. (26) ], its dimension is not less than 


n—m-+1, that is, 
rank (U — I) Sm-—1. 


Thus our assertion (b) follows by applying Lemma 3 to T=AU. 
It remains to be shown that G(U—TJ) is not skew-symmetric. We 
have 


2aa’G 


a'Ga 
T bb’ Té G(T. + 1) T obb’S 
—20'Sb 
[cf. (27), (29), and (22)]. Thus 
Sbb'S 
GU =s —-—— 
b’Sb 


and (c) follows from (24). 


THEOREM 1. Suppose T is orthogonal and G(T —I) is not skew-sym- 
metric. Then T can be written as a product of m=rank (IT —I) sym- 
metries, but not of less than m. 


Proor. For m=0 and m=1 our statement is trivial [cf. Lemma 2]. 
Suppose it is proved up to m—121. From the corollary of Lemma 3, 
T cannot be a product of fewer than m symmetries. Thus, our 
theorem follows from Lemma 5 and our induction assumption. 


4. From now on we assume not only that T is orthogonal but also 
that G(T is skew-symmetric. Put 7—J, m=rank Being 
the rank of the skew-symmetric matrix G7», m is even. As the case 
m=O is trivial, we may assume m22. 

From our assumptions 


(31) GT, + TiG = 0. 
Hence 
T'GT —G = (Tg + NG(T. + I) —G = TiGT, + GT, + TUG = 0 
implies 
(32) = 0. 
We obtain from (31) and (32) GT7?+7)’GT,=GT¢=0. Since G is 


| | 

| 
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regular, it follows that 


(33) T= 0. 


Our definition of m implies that T) maps the whole space #, on 
an m-space §,,, and the set of the vectors x with T7yx=0 forms an 
(n—m)-space R»-m. From (33) 

(34) Rm C 


therefore 
or msn/2. 


If x<CRn, yCRa-m, then x =T oz for some z, and Tyy=0. From (31) 
= = — = 0. 


So, the two spaces ,, and R,-» are perpendicular. Since the vectors 
perpendicular to ,_,, form an m-space containing R,, this m-space 
is equal to R,,. Hence, a vector is perpendicular to R,-» if and only if 
it lies in R,,. We obtain from (34) that &,, is perpendicular to itself. 


In particular, every vector in ®,, is isotropic. 
Let 


2aa’G 


a'Ga 


(35) A=I- 


be an arbitrary symmetry. Thus a may be any non-isotropic vector. 
Since it cannot lie in ®,,, it is not perpendicular to R,z-m, and the 


(n—1)-space ®,-1 perpendicular to @ does not contain R,-». The 
intersection 


of these two spaces is therefore an (w—m—1)-space. 
Let U=AT. We first show that R,_m-: is the eigenspace of U be- 
longing to the eigenvalue 1. This implies in particular that 


(37) rank (U — I) = m+ 1. 


If xCR»-m—1, then from (36) and (35) Ux =ATx=Ax=x. 


Conversely, suppose Ux=x. Then Tx=A?Tx=AUx=Ax and 
hence 


2a'Gx 


a'Ga 


(38) Tox = a. 


| 
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From (38) and (32) ( 


2a’Gx\? 
= ( ) eo = 0. 
a’'Ga 


Since a’Ga #0, this implies a’Gx =0 or xC®,-1. Going back to (38), 
we obtain Tox =0 or xCRa_m. Thus x CRa_m—1. This proves the above 
statement. 

Since m is even, 


rank G(U — J) = rank (U — I) = m+1 


is odd. Hence, G(U—J) cannot be skew-symmetric. 

From Theorem 1, U is a product of m+1 symmetries. Hence 
T=AU can be written as a product of m+2 symmetries. Suppose 
we have expressed T as a product of k symmetries. Then we may put 
T=AU where A is a symmetry (35) and U is the product of k—1 
symmetries. Since U=AT, we arrive again at (37). From the corol- 
lary to Lemma 3, 


k—-12m+1, thatis, k=>m-+ 2. 
So we have the following theorem. 


THEOREM 2. Suppose T is orthogonal and G(T—I) is skew-sym- 


metric. Let 
m = rank (T — 1). 
Then 
(39) m=0O(mod2) and n/2, 


and T can be decomposed into a product of m+2 but not fewer than 
m+2 symmetries. 


In order to find these transformations T7#J, we choose a basis 
whose m>0 first vectors span R,,. If m<n/2, the next m —2m vectors 
of this basis shall lie in ®,_,,. Since 


“Gy = yGx =0 for all x C Ru, y C 
G has in these coordinates the form 
0 O G 
n 0 Gi\. n 
(40) G=|0 G@ * it G=( ) if 
* 2 


Here G,; and G; are regular square matrices; G; is m-rowed, G: is 
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(n —2m)-rowed and symmetric. We have 


Tox CR, forall «CR, and Toy=0 forall yC Ram. 


To = 
0 0 


where 7; is an m-rowed squared matrix. Its regularity follows from 


rank T)>=m. We obtain 
0 0 
GT» = ( ). 
GiT; 


Thus (32) and (33) are satisfied. Finally, GT) is skew-symmetric if 
and only if the same holds true of G,7;. This leads to the following 
construction: 

Choose 2 =4 arbitrarily, m>0 according to (39) and then G accord- 


ing to (40). Then 
To = 
0 0 


where 72 may be any m-rowed regular skew-symmetric matrix. 
If we take, for example, n=4, m=2, 


1 0 
G, = ( ) and Te 
0 1 


Hence 


ll 
| 

© 


we may put 


0 0 1 0) 00 Oo 1)3 
00 

G= | and = 
10 0o0 00 00 
0 1 0 0} 00 


UNIVERSITY OF SASKATCHEWAN 


3 Professor Coxeter has made the following comment on the last example: “A 
space with two space-like and two time-like dimensions admits a transformation 
leaving a whole plane invariant although it is not merely a rotation. The explanation 
is, of course, that this invariant plane is isotropic.” 
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ON THE POLYNOMIAL SOLUTIONS OF THE GENERALIZED 
LAME DIFFERENTIAL EQUATION 


MORRIS MARDEN 


1. Introduction. The generalized Lamé differential equation with 
which we shall be concerned is 


(1.1) P(z)w” + Q(z)w’ + R(z)w = 0, 


where P(z), Q(z), and R(z) are real polynomials of degrees p, at most 
p—1, and at most p—2 respectively. We shall assume that P(z) has 
p distinct zeros a;=b;+ic; of which the first g¢, 0<qSp/2, lie in the 
upper half-plane, the next g are respectively the conjugate imaginaries 
of the first g, and the remaining p—2g (if any) are real. That is, 


for iSjZq, 
= — Cia for gq<j 2q, 

and, if 2q¢<p, 
c;=0 for 2g <j Sp. 


We assume further that Q(z) has no factor in common with P(z) and 
that in the development 


(1.2) Qz) 


the coefficients satisfy the inequalities 
A;>0, |a;| fori <j p. 


According to Heine (see Marden [1, §9], or Marden [2]),! there 
exist some choices V(z) of the polynomial R(z) corresponding to each 
of which equation (1.1) has a polynomial solution S(z). The S(z) were 
called Stieltjes polynomials by Van Vleck in honor of the man who 
first studied the location of their zeros relative to those of P(z). For 
a similar reason, we shall call the V(z) Van Vleck polynomials. 

In a previous paper (Marden [2]) we obtained the following result: 

For each j=1, 2, - - - , q let there be constructed an ellipse e(a;) with 
the points a; and 4; as foci and with cos a; as eccentricity. Let K be the 
smallest convex region which encloses both all the ellipses e(a;), j 
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=1, 2,---, and all the real points a;, 7=2q+1, 2g+2,---, p2 
Then all the zeros of every Stieltjes polynomial S(z) and all the zeros of 
the corresponding Van Vleck polynomial V(z) lie in K. 

This result is analogous to the theorem of Lucas (see Marden [1, 
§6]) that all the zeros of the derivative of a polynomial f(z) lie in the 
smallest convex region containing all the zeros of f(z). 

In the present paper on the zeros of the S(z) and V(z), we propose 
to develop some results which will be analogous to the following 
theorem of Jensen (see Marden [1, §7]). 

Every non-real zero of the derjvative of a real polynomial f(z) les in or 
on at least one of the circles (called the Jensen circles of f(z)) which has 
as diameter the line segment joining a pair of conjugate imaginary zeros 
of f(2). 

We propose to develop these results by a combination of the 
methods introduced in Walsh [1] and Marden [2 and 3]. 


2. Two lemmas. Let us denote by m the degree of S(z), by & the 
number of pairs of conjugate imaginary zeros of S(z), by 2;=x;+7y; 
(j=1, 2,---,) the zeros of S(z) in the upper half-plane, and, if 
k<n/2, by 2; (j7=2k+1, 2k+2, - - - , m) the real zeros of S(z). Thus, 


S(z) = (2 — 21)(2 — 21) - - - (@ — an) — — (3 — Zn). 


Also, let us associate with each pair a;, d; (j=1, 2,---,q) an 
ellipse E(j, k) defined as follows. Its center will be at the point 7; 
on the real axis such that angle d;, a;, 7; is equal to a;; that is, 


(2.1) 7; = 6; + c; tan aj. 
Its minor axis will be parallel to the y-axis and be of length m; 
=2|a;—7;| , and its major axis will be of length ('/?m;). 

We shall now state two lemmas concerning these ellipses. 


Lemna 1. The envelope of all the circles whose diameters are the vertical 
chords of the ellipse E(j, k) is the ellipse E(j, k+-1). 


As this lemma may be established by elementary calculus, we shall 
proceed to the next lemma. 


LEMMA 2. If a non-real zero 2 of S(z) lies exterior to all the circles 


E(j, 1), 7=1, 2,---, 4, then it les inside at least one of the Jensen 
circles J(zu) of S(z), u=2,3,---,k. 


ProoF. According to equation (9,3) in Marden [1], the zero 2, not 
being a point a;, satisfies the equation 


2 If pS2q, reference to real points a; should be omitted. 


— 
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Qa; 2 


= 0; 
j=l 210; 2; 
that is, 
a k n 
(2.2) Vi+ DXi+ DY V5 4+Z=0 
j=1 j=2q+1 j=2 j=2k+1 
where 


U; = Ajle*i(z, — + — 

X; = 2[( — + — 24, 

Y; = — 2;)"', 

Z = — 


| 


Let us compute the sign of the imaginary part of each term in 
equation (2.2). From the equation 


2i Im U; = — — — 
= — — a) 
+ — — a) 
it follows that 
sg Im U;= —sg (y: Re — 
= —sg { y:[(a1—b;) +91 tan a;| cos a;} 
=—sg {y:[(1—7,) sec 
From the definitions of V; and Z, it follows at once that 
sg ImV;= —sgyi and sgImZ = — gg yu. 


To obtain Im X;, we need only in U; replace a; by 2; and A,e** by 
unity. Similarly, to obtain Im Y; we need only in V; replace a; by 3;. 


Thus, 


sg Im X; 
sg Im Y; 


2 2 2 
— sg — x) + 91 — 
— Sg 71. 


If, now, we suppose the zero 2 of S(z) to lie outside both all the 
circles E(j, 1), j=1, 2,---,9, and all the Jensen circles J(z;) of 


| 

| 

| 
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S(z), 7=2, 3, ---+,, then the imaginary part of each term on the 
left side of equation (2.2) would have the same sign, namely sg (— +). 
That is, the left side of equation (2.2) would not vanish. Hence, if 
all the U; have the sign sg(—y,), at least one X; must have the op- 
posite sign. That is, if 2, is outside all the circles E(j, 1), it lies inside 
at least one circle J(z;). 

We remark in passing that Lemma 2 may be modified as follows so 
as to admit the possibility of both positive and negative A; in equa- 
tion (1.2). 


LemMA 2’. If a non-real zero 2, of S(z) lies simultaneously exterior 
to all the circles E(j, 1) corresponding to positive A; and interior to all 
the circles E(j, 1) corresponding to negative A;, then it lies inside at 
least one Jensen circle J(Zu) of S(z), u=2,3,---,k. 


However, as Lemma 2’ is not suited to iteration, we shall revert to 


our original hypothesis on the A; and in the following sections we 
shall use only Lemma 2. 


3. The zeros of the Stieltjes polynomials. By applying Lemmas 1 
and 2, we shall now construct a proof of the following theorem, which, 
when a;=0 for all j, reduces to a result in Walsh [1]. 


THEOREM I. Let S(z) be any Stieltjes polynomial which possesses k 
pairs of conjugate imaginary zeros. Then each non-real zero of S(z) lies 
in or on at least one of the ellipses E(j, k), j7=1,2,---,4. 


Let us suppose that the non-real zero 2; of S(z) lies outside all the 
circles E(j, 1). Then, by Lemma 2 it lies inside at least one Jensen 
circle of S(z), say J(z2), the circle which has as diameter the line- 
segment joining z: and Ze. If also 22 lies outside all the circles E(j, 1), 
then it lies in at least one of the remaining Jensen circles of S(z), say 
J(z3). If we continue this process, the number of remaining Jensen 
circles diminishes. Eventually for some u, uk, though 2,_, lies out- 
side all the circles E(j, 1) and hence say in J(z,), the point 2, lies in- 
side at least one circle E(j, 1), say E(1, 1). 

Now, applying Lemma 1, we infer that 2,_,; lies in E(1, 2); conse- 
quently that 2,_2 lies in E(1, 3), and so on, and finally that 2 lies in 
x). 

Since ellipse E(1, u) for «Sk is contained in the ellipse E(1, e), 
we have completed the proof of Theorem 1. 

Concerning the derivatives of S(z), we may state the following 
theorem. 


THEOREM II. Let S“(z) denote the rth derivative of the Stieltjes 
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polynomial defined in Theorem 1. Then every non-real zero of S“(z) 
lies in or on at least one of the ellipses E(j,k+-r),j7=1,2,--+-,4¢. 


To prove Theorem II, let us denote by 2” the zeros of S®(z). 
According to Jensen’s Theorem, each non-real zero 2” lies in at least 
one Jensen circle of S*—-(z), say J(2¥~”). The zero lies in at 
least one Jensen circle of S*-®(z), say J(z’~”), and so on. The zero 
2, must lie in at least one Jensen circle of S(z), say J(z:). Since, accord- 
ing to Theorem I, % lies in an ellipse E(j, k), say E(1, 2), we learn 
again by repeated use of Lemma 1 that 2%” lies in the ellipse E(1, k+7). 

Theorems I and II presuppose our knowledge of the number of 
pairs of conjugate imaginary zeros possessed by S(z). If, however, 
we know only the degree m of S(z), we may find the following im- 
mediate corollary of these theorems more useful. 


CorRoLiary 1. Let S(z) be a Stieltjes polynomial of degree n. Then 
every non-real zero of S(z) lies in or on at least one of the ellipses 
E(j, n/2) and every non-real zero of S“(z) lies in at least one of the 
ellipses E(j, (2r+-n)/2), 7=1, 2, +--+, If nis odd, these ellipses may 
be replaced by the smaller ellipses E(j, (n—1)/2) and E(j, (2r-+-n—1)/2) 
respectively. 


4. The zeros of the Van Vleck polynomials. We shall now prove the 
following theorem. 


THeEoremM III. Let V(z) be the Van Vleck polynomial which cor- 
responds to the Stieltjes polynomial S(z) of Theorem 1. Then every non- 
real zero t; of V(z) lies im or on at least one ellipse E(j, k+2), j 
=1,2,---,¢. 


Let us consider the following three cases. 

A. Point 4, lies in or on some circle E(j, 1). 

B. It is one of the zeros 2{, 2/, ---, 2,~-; of S’(z). 

C. It is neither a point 2/, nor a point in or on a circle E(j, 1). 

The theorem is obviously true in case A. In case B, according to 
Theorem II, & lies in some ellipse E(j, k+1) and hence in some 
ellipse E(j, +2). In case C, we shall use the fact that ¢; satisfies equa- 
tion (9,5) of Marden [1]; namely, 


A n—-1 1 
(4.1) > —+ — = 0, 
jai — Gy ju — 


The left side of (4.1) is comprised of terms of the same form as the 
U;, V;, X;, and Y; of equation (2.2). As in Lemma 2, we conclude 
therefore that 4, must lie in at least one of the Jensen circles of S’(z), 


| 
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say J(2/:). By Theorem II, z/ must lie in at least one of the ellipses 
E(j, k+1) and hence by Lemma 1 point #, must lie in at least one of 
the ellipses E(j, k+2). This completes the proof of Theorem III. 

If we now employ the same reasoning as for Theorem II, we may 
infer the following result concerning the derivatives of V(z). 


THEOREM IV. Let V“(z) denote the rth derivative of the polynomial 
V(z) of Theorem I11. Then every non-real zero of V“ (2) lies in or on at 
least one of the ellipses E(j, R+-2+1r), 7=1, 2,-+-,4. 


Finally, we may infer at once from Theorems III and IV the follow- 
ing complement to Corollary 1. 


CoROLLARY 2. Let V(z) be the Van Vleck polynomial corresponding 
to a Stieltjes polynomial of degree n. Then every non-real zero of V(z) 
lies in or on at least one of the ellipses E(j, (n+-4)/2) and every non- 
real zero of V“(z) lies in or on at least one of the ellipses, E(j, (2r+-n 
+4)/2), j7=1, 2,---,¢. If n ts odd, these ellipses may be replaced by 
the smaller ellipses E(j, (n+-3)/2) and E(j, (2r+-n+3)/2) respectively. 
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ON THE ALGEBRAIC INTEGRALS OF A SYSTEM OF 
DIFFERENTIAL EQUATIONS OF MECHANICS 


J. A. BREVES FILHO 


Introduction. If in the system of differential equations 


dx, dx, 
%1%3 + Ys, > X2X%3 — PY3, 
1 
(ey1 — 42), 
dy; dy2 
= — X3¥1, = — 
1 


the parameters X, p are replaced by a, 8, the resulting system will be 
denoted by (1; a, 8). The present paper will present a study of the 
algebraic integrals of the system (1; A, 1). 

The equations (1; A, 1) are, excepting for a change of variables, 
those of the motion of a solid about a fix point, in a uniform field of 
force, when the ellipsoid of inertia relative to the fix point is of 
revolution and the baricenter of the solid belongs to the equatorial 
plane of the same ellipsoid. 

Theorems 3 and 4 are original. The demonstration of Theorem 3, 
presented by R. Liouville [1],' accepted by P. Burgatti [2], is incom- 
plete. In fact, it is implicitly admitted that one of the polynomials 
belonging to the rational integral possesses terms independent of 
¥1. Y2, Ys. The method of demonstration of Theorem 4 was inspired 
from a paper of Husson [3]. 

For the sake of simplicity of nomenclature x will be used instead 
of (x1, x2, x3) and y instead of (y1, ye, ys) wherever possible. 

Let us, “ab initio,” make the following two observations about the 
system (1; A, 1): 

(a) The differential equations (1; A, 1) partake of the following 
particular property of homogeneity: they are not altered when the 
x variables are multiplied by a constant k, the y variables by k?; 
and t, by k=. 


Received by the editors June 20, 1949. 
' Numbers in brackets refer to references cited at the end of the paper. 
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We shall say that a function f(x, y) is homogeneous (in the sense 
alluded to) if f(kx, k*y) =k™f(x, y). 

The scalar m will be called the degree of homogeneity of the 
function. 

If the function f(x, y) is homogeneous of the degree m, its deriva- 
tive df/dt, as considered from (1; A, 1) as a function of x, y, will be 
of the degree m+1. 

In fact 

df(kx, k*y) 


= 


d(k-t) dt 
(b) The system (1; A, 1) is not altered when a permutation of 
*1, X2 is made as well as of 41, y2 so long as —¢ is substituted for +4#, 
or, when —<x3, —yz3 are substituted for +x3, +3 as well as —¢ for 


+t. 


General case (1; A, 1). For the sake of completeness the following 
theorem is included. 


THEOREM 1. Every algebraic integral of the system (1; , 1), 
$(x, ¥), 


is an algebraic combination of rational integrals. 


In fact, if a function ¢ is algebraic, it will be a root of an equation 
(2) + Aig? '+---+Apr6 + A,= 


whose coefficients A; are rational functions of x, y. Without any re- 
striction in generality we may suppose that this equation is irre- 
ducible, as otherwise we would take one of its irreducible com- 
ponents instead of (2). 
Since ¢ is an integral of (1; A, 1), we have d¢/dt =0, and therefore, 
by deriving (2) we have 
dA, dAy-1 dA, 
= 


0, 


the coefficients dA ;/dt being rational functions of x, y. 

Equation (2) being, by hypothesis, irreducible, equation (3), of a 
lower degree, cannot subsist. 

We must, therefore, have 


| 
\ 
aA; 
— = 0, (i= 1,2,---, p). 
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These relations show that the functions A; are (rational) integrals 
of system (1;A, 1). Owing to (2), ¢ is an algebraic combination of these 
integrals. 


THEOREM 2. Every rational integral of system (1; X, 1), 
R(x, y) 
S(x, y) 
ts a combination of homogeneous and rational integrals. 


Let us, initially, consider the developments 
R(kx, k?y) 

= H(x, y, k)[Lo(x, y)k? + Li(x, + --- + 
= H(x, y, k)L, 

(4) S(kx, 
= y, k)[Mo(x, + Mi(x, +--+ + M(x, y)] 
= H(z, y, k)M, 

where Lo, Li, - - -, M, are homogeneous polynomials and H(x, y, k) 


is the H.C.F. of R(kx, ky) and S(kx, k*y). According to the observa- 
tion (a) of the introduction, 
R(kx, k?y) L 


S(kx, M 


is still an integral of the system (1; A, 1), and therefore 


that is, 
aL d 
—— <= = 0, 
dt dt 
whence, 
dM, 
) 
dt at 
dL, dLo dM, dM, 
dt dt dt di 


The constant k being arbitrary, the following relations are ob- 


| 


k) 


ob- 
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tained, 
dLo 
Mo = 0 = 0, 
dt dt 
5 dL aL dM dM 
(8) Mo 0 ~ = 0, 
dt dt dt dt 


which constitute a system of p+q+1 homogeneous linear equations 


on the derivatives of the +q+2 polynomials Lo, Li, -- +, M,. The 
matrix of the coefficients of this system is 


|| Mo 0 ---Q0 
|| Ma My--- 0 


| o O---M, 


L and M being prime, their (Sylvester) resultant is not identically 
null. Thus the rank of this matrix must be p+gq+1 and the system 


(5) will have one degree of indetermination. Under these conditions 
it necessarily follows from (5) that 


aL; 
= al; 
(6) G= 1,2 1,2 ) 
dM; P33 qg 
= al; 
dt 


a being a suitable factor of proportionality. In fact, system (5) is 
identically verified for the values (6). 


From the above relations (6), 


are (homogeneous) integrals of system (1; A, 1). From (4) we con- 
clude that R/S is a rational combination of these integrals. 


THEOREM 3. Every homogeneous and rational integral of system 
(1; A, 1), 


| 
| 
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Ro(x, y) 
So(x, y) 
ts the quotient of the division of homogeneous and whole integrals. 


Without any loss of generality, let us suppose the polynomials 
Ry and Sp to be prime. 


We have 
d (Ro 
that is, 
dRy 1 dS 
x 
or, simply, 
(7) 
dt 


where y is a polynomial. According to observation (a) of the intro- 
duction, » must have degree 1 of homogeneity, and therefore 


#1, M2, and ws being constants. Thus (7) is transformed into 
dRy 


(8) = + + Ro. 


According to observation (b) of the introduction, the polynomial 
R, which is obtained from Ry by permutation of x, yi, and x2, yo 
satisfies the relation 


dR, 


(9) = — + + Ri. 


From (8, 9) it follows that 


dRoR; 
dt 


(10) = (41 — — %2) RoR. 


Based on the same observation, the polynomial R, which is ob- 
tained from RoR, by substituting — x3, — 3 for x3, ys satisfies the rela- 
tion 


| 
| 

| | 
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dR: 
(11) (u1 M2) (x4 %2)R2. 
dt 
From (10, 11) it follows that 


and, consequently, RoR, R: is an integral of system (1; \X, 1). 

Finally, one can conclude that the rational integral Ro/S» may be 
considered as the quotient of the two homogeneous and whole inte- 
grals RoRiR2 and SoR 


Kowalewski’s case (1; 1, 1) [4]. It is known that the system 
(1; 1, p) admits the following four algebraic integrals 


1 
(12; 1, p) 2 X3+ pyit yo= hy, + + = he, 


2 2 2 
yiye + pys = hs, (px, — 2y1)(x2 — 2ye) = Ig. 


THEOREM 4. The system (1; 1, 1) does not admit any other algebraic 
integral distinct from the integrals (12; 1, 1). 


Let 
(13) f(*1, Vi, Va, Xe, X3, y2) 


be an algebraic integral of system (1; 1, 1). We shall prove that f is 
an algebraic combination of the integrals (12; 1, 1). 

The system (1; 1, p) admits, whatever be the value of p, the alge- 
braic integral 


f(px1, PY1, X2, Xs, 
or, owing to (12; 1, p), 
F(hy, he, hs, ha, p, Xs, ys), 


F being algebraic with respect to every one of the arguments. 

To prove that (13) is an algebraic combination of the integrals 
(12; 1, 1), it is sufficient to prove that F is independent of x; and y3. 

Let us, in the first place, notice that x; and y; are not integrals of 
the system (1; 1, 1), therefore F cannot depend only upon x; or upon 
yz. Let us then suppose that it depends upon x; and 43. 

One can always multiply F by a convenient power of p, so chosen 
that F, considered as a function of p, will not admit the point p=0 
neither as a zero nor as a pole. The algebraic function F of p is then 


| 
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developable, in the domains of the value p =0, according to the rising 
positive powers of p or of p'/?. In such a development the coefficients 
of the various powers of p are algebraic functions of hy, he, hs, he, X3, Ys. 

We may always suppose that for p=0, F will not be reduced to a 
single function of hy, he, hs, hs. In fact, let us carry on the develop- 
ment of F according to the powers of p’/”, until the first term whose 
coefficient be not reduced to a single function of fi, he, hs, hs. Let it be 


F he, hs, ha, p) + p*!?F he, hs, ha, p) + + 
Substituting for F the expression 


F be he, hs, ha, p) 
p*lp 


which means subtracting from the integral f an algebraic function 
of the classic integrals, we see that one may always suppose 


(14) F = Fo(hi, he, hs, ha, X3, ys) + +: 
where Fy is not independent of x3 and y;. 


The system (1; 1, 0) admits the algebraic integrals (12; 1, 0) and 
must also admit the following algebraic integral 


he, hs, ha, %3; Ys) 


which results from (14) for p=0. Let us seek for this integral. 
From (12; 1, 0) one gets 


2 
2y2 = — x 
(15) 
— Xe = he — — h(2h, — 4%3) 
where h=2[(4h3—hy)hs 
From the third and the last equations of (1; 1, 0) it follows that 
dys 
2y2 — X21 
Owing to (15), this equation becomes 


dx3 dys 


2h; — 28 hg — — — 


The solution obtained is 


ya(2hy — + f [h(2h; — — — x3) das = 


| 

| 

| 


it 


1950] FUNCTIONAL EQUATIONS 505 


which is a nonalgebraic integral of the system (1; 1, 0). 

This conclusion contradicts the fact of Fy being an algebraic func- 
tion, therefore the initial hypothesis that assumes F to be dependent 
upon x3 and y; is absurd. 

F being independent of x; and ys, it follows, as initially said, that 
any algebraic integral of the system (1; 1, 1) is an algebraic com- 
bination of the four integrals (12; 1, 1), that is, such a system does not 
admit any other algebraic integral distinct from the integrals (12; 
1, 1). 
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NOTE ON THE FUNCTIONAL. EQUATIONS 
S(xy) = f(x) + fy), f(x") = nf(x) 


JOSEPH MILKMAN 


Introduction. M. Augustin-Louis Cauchy in his famous Cours 
d’Analyse de l’école Royale Polytechnique (Part I, Analyse algébrique, 
chap. 5, p. 109), published in 1821, gives a beautiful proof that the 
only continuous solution of the functional equation f(x) +f(y) =f(xy), 
where f(x) is defined for all real numbers x, is the function f(x) =a In x. 
Cauchy’s proof reduces the equation to the Cauchy equation f(x) 
+f(y) =f(x+y). In 1905 G. Hamel in the Mathematische Annalen 
proved that the discontinuous solutions of Cauchy’s equation are 
totally discontinuous. In 1919 B. Blumberg proved that the only 
measurable function satisfying Cauchy’s equation is the function Ax, 
hence the only measurable solution of 


(1) f(xy) = f(%) f(x") = nf(x) 


is a In x. In the past thirty years there have been a number of papers 
on these functional equations where the domain of the variable has 
usually been the real number system. In 1946 Erdés proved that if 
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f(m) is additive and f(m+1) =f(m) then f(m) =c In m (Ann. of Math. 
vol. 47 (1946)). This implies Theorem I of this paper. Professor H. 
Shapiro of New York University suggested that a more elementary 
proof of Theorem I than Erdés’s should be possible and this led me 
to the proof of Theorem I, given below, in which I show that any 
solution f(m) can be imbedded in a continuous solution f(x), thus 
rendering the theorem a corollary of Cauchy’s. I wish to express my 
thanks to Professor Freidman of New York University for some sug- 
gestions he made regarding this paper. 


THEOREM I. If f(m+1)>f(m) and f(mn) =f(m)+f(m) hold for all 
positive integers m and n, then f(m) =c \|n m. 


Proor. If m/n, m and n are positive integers, f(m/n) =f(m) —f(n) 
since m=(m/n)-n. If m and n are relatively prime positive integers, 
we define f(m/n) =f(m) —f(n). If X is an integer and m and » are rela- 
tively prime, f(m/n) =f(m) +f(A) —f(m) —f(A) =f(Am) —f(An), so that 
if we define f(m/n) =f(m) —f(n) for any two integers m and n, f(m/n) 
is a function of the rational number m/n. 

If a, b, c, and d are positive integers such that a/b>c/d, then 
f(a/b) >f(c/d), since f(a/b) =f(ad) —f(bd). f(c/d) =f(bc) —f(bd) and 
f(ad) >f(bc). 


If a, b, c, and d are positive integers, 
flac/bd) = f(ac) — f(bd) 
= f(a) + f(c) — f(b) — f@) 
= f(a/b) + f(c/d). 
f(m) =f(m) +f(1), therefore f(1) =0. 

If b;>1 and {b;} is a sequence having 1 as a limit where J; is a ra- 
tional number, for ¢ sufficiently large f(b,) >f(b;) >f(1) =0. Therefore 
there exists a monotonically decreasing subsequence {}/ } of {6,} 
having 1 as a limit and a corresponding monotonically decreasing 
sequence {f(b:) } having a real number f as a limit. Suppose f+0. 


Let N be an integer such that Nf>f(2); let bY be a 6/ such that 
1<by then <2, 


= Nf(bx) 


or 


$2) 
N N 


f(bx) = 


t 
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contrary to the choice of f(b/ ) as a monotonically decreasing sequence 

approaching f as a limit. Therefore f =0. Since f was any limit point 

of the sequence {f(b,;)}, we have the result that for any sequence of 

rationals b;>1 such that b;—>1, f(b;)—0. If r is rational and positive 
f(i/r) = fl) — f(r) = — f(r). 

Therefore if and 5; <1, f(b;)-0. From the above we conclude 

that if b;—1, f(d;)—0. 

If {a;} is a convergent sequence of positive rationals f(a,,) —f(a,) 
=f(an/a,)—0 as | dm —a,| —0, for as | am —0, a,/a,—1, therefore 
{f(a:) } converges. Let lim;... a;=a. If a is rational, then a;/a—1 and 

f(a;/a) =f(a;) —f(a)—0 and hence f(a;)—f(a). If a is irrational, de- 
fine f(a) as lim;... f(a;). f(a) is unique for if a;—a, f(a;)—>f and if 
a} —a, f(a! )—’, then 


a; 

f(a:) — f(ai) = i(=) 

but a;/a/ —1, hence f—f’ =0. We have now defined f(x) for all posi- 
tive real numbers x so that when x;—>x, f(xi)—>f(x), that is f(x) is 
continuous. If x and y are any real numbers, let {x;} and {y,} be se- 
quences of rational numbers such that x;—x and y,;—y, then f(x:y;) 
=f(x:i)+f(y:) and hence f(xy) =f(x)+/(y) for all positive real num- 
bers x and y. The only continuous solution to the functional equation 


f(x) + f(y) = f(xy) 


f(x) = cln x. 
Therefore 
f(m) = clnn. 


THEOREM II. Jf f(x) is a monotonically increasing function defined 
for all positive real numbers x, such that f(x")=nf(x) for all positive 
integers n, then f(x) =k |n (x) where k=f(e). 


Proor. f(x) =nf(x"/") or f(x") =1/nf(x). f(x™") =(m/n)f(x), or if 
r is a rational number, f(x’) =rf(x). In particular f(e") =rf(e) =f(e) 
-In (e"). Let x be any positive real number and let y=In x, that is 
x =e". Let the sequence be a monotonically increasing sequence 
of rational numbers having y as a limit, then ex and {e"} is 
monotonically increasing and {f(e") } is monotonically increasing and 
has the limit of f(e) In x. If f(x) <f(e) In x, then there is an 7, such 
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that f(x) <f(e™") with x>e"* contrary to the monotonicity of f(x). 
If f(x) >f(e) In x, let {r:} be a decreasing sequence of rational numbers 
having y as a limit, then {ers} is monotonically decreasing to the limit 
x and {f(e")} is a monotonically decreasing sequence having f(e) In x 
as a limit. We can now find an r,, such that f(x) >f(e") with x <e™, 
contrary to the monotonicity of f(x). Therefore f(x) =k In x. 


Coro.iary. [f f(x?) = pf(x) and f(x) is monotonically increasing for 
all real numbers x>0 and all prime numbers p, then f(x) =k \n x. 


Coro.uary. f(x?) =pf(x) and f(x) ts monotonically decreasing 
for all real numbers x >0, and all prime numbers p, then f(x) = —k |n x. 


Coro .iary. If f(r) is a monotonically increasing function on a set S 
which is everywhere dense among the positive real numbers and contains 
all positive integral powers of all members of the set, for example, the 
rational numbers, and f(r?) =pf(r) for any prime p, then f(r) =k |n r. 


ProoF. f(r") =nf(r) for all r in S and positive integers n. Let R bea 
real number and let {r;} be an increasing sequence of elements of S 
having R as a limit, then if {r/ } is another such sequence, the limit 
L of {f(r3) } =the limit L’ of {f(ri) }. For suppose L > L’; then there 
exists an 7, such that f(r,) >L’, and anrj,>r,, hence L’>/(r},) >f(ra), 
a contradiction. Similarly, we arrive at a contradiction if we assume 
L’'>L. Therefore L =L’. We may thus extend the domain of f(r) un- 
ambiguously to all real numbers greater than 0, by defining f(R) to 
be the limit of {f(r,)} for any irrational positive number R. Under 
this definition monotonicity of the function holds in the new extended 
domain of all positive real numbers. Since R* is the limit of {rf i. 


r; defined above, f(R*) =nf(R) for any real number R and the corol- 
lary follows from the theorem. 


UNITED STATES NAVAL ACADEMY AND 
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TRANSITIVE SYSTEMS OF LINEAR OPERATORS 
ON A BANACH SPACE! 


BERTRAM YOOD 


Let ¥ be an infinite-dimensional Banach space. Mackey? has shown, 
under more general conditions, that if x1, ---,%n, ¥1,°°*, Yan are 
any two sets of linearly independent elements in %, then there exists 
an isomorphism T of ¥ with itself such that T(x;)=y;,7=1,---, m. 
In other words the collection of isomorphisms is transitive for linearly 
independent sets of elements of ¥. This property is shared by many 
other sets of linear operators, for example by the set of all operators 
with finite-dimensional range in ¥. These two sets of linear operators 
are semi-groups.’ In this note we give a condition for the transi- 
tivity property above which is necessary for all sets of linear operators 
and which while not in general sufficient is so for semi-groups. In 
this result the operators need not be assumed to be bounded or de- 
fined everywhere in ¥. €(X) will be used to designate the collection 
of all bounded linear operators with domain ¥ and range in %. 


THEOREM 1. Let & be a collection of linear operators with domain and 
range in an infinite-dimensional Banach space %. For © to be transitive 
for linearly independent sets of elements of X it is necessary that for 
each finite-dimensional subspace F of & there exists a T€@ which is the 
identity on F and a number e>0O such that if UCE(X) takes F into F 
and ||U|| <e, then there exists an operator VE@ which agrees with 
T+ U on F. If © is a semi-group, this condition is sufficient. 


It is readily seen that the condition is necessary. That it is not 
sufficient in general can be seen by taking © to be an e-neighborhood 
of the identity in the Banach space €(%). 

Let @ be a semi-group. Let F be a finite-dimensional subspace of 
% and T> be its associated operator in G, and let x1,---,%, be 
linearly independent elements of F. There exist x* in ¥* (the conju- 
gate space of ¥) such that =46;;, 1, 7=1, - - - , For elements 


Received by the editors December 28, 1948 and, in revised form, June 13, 1949. 

1 This paper is based on a portion of the author’s dissertation, Yale University, 
1947. The author wishes to thank the referee for his suggestions which made possible 
an appreciable shortening of proof of Theorem 1 and clarified the nature of that 
proof. 

2G. W. Mackey, On infinite-dimensional linear spaces, Trans. Amer. Math. Soc. 
vol. 57 (1945) pp. 155-207, see Theorem II-3. 


3 By a semi-group of linear operators is meant a collection which contains TU 
whenever it contains T and U. 
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F for which ||»; —x,|| <5=min {1, €} /2n max ||x¥\|, let the trans- 


formation W be determined by 


W(x) = — 2). 
i=] 

Then ||W||<min {1, e}/2. Thus T7)+W is an isomorphism on F. 
The operator U= bw (—1)*W* has the properties || U||<e and 
(To+ U)(To+ W)x =x for all x in F. By hypothesis there exist trans- 
formations U; and W, in @ which agree with 7)>+U and 
respectively in F. Then by the above W,(x;)=0; and U,(v;) =x;, 
i=1,---,m. If also for each i, <6, vf F, we may define 
Wi €@as above so that W{ (x;) =v/. Then Wi U,(0;) =v/ ,i=1, 
n. The semi-group property of @ shows that (a) for each F and 
1, °° *,X» linearly independent in F there exists a 6>0 such that 
for every v;, w; in F with ||v; —x,|| <6 and ||w;—-~x,|| <5, there exists V 
in with V(v;)=w;,7=1, ---, 

Next for a fixed F and mSdim F consider the Cartesian product 


space F=FX - - - XF (m factors) made up of elements of the form 
with =maxisn F is a finite-dimensional 
Banach space.‘ Let L be the set of all #€ F such that x, - - - , x, are 
linearly independent in F. By (a), L is open in F and to each point 
# of L there exists a 5>0 such that ||#—4]| <5 and \|z—-2 | <6 imply 
the existence of a transformation VE€@ such that V(é)=@ where 
V(s) =(V(m), ---, V(v,)). Let K be the set of § in L which can be 


reached from £ along a finite chain of such spheres each overlapping 
its predecessor. Then by (a) for such a f there exists a sequence 
Vi, ---, such that for the product W=V,--- Vi, W(#)=9 
and by the semi-group property, WEG. 

From this it follows by a standard topological argument, used in 
connection with analytic continuation and elsewhere,’ that the set 
K is open and closed in LZ. Therefore (b) if # and j are in the same 
component of I, there is a VEG such that V(2z) = 

Call # and fully independent if x1, ---,%Xn, Vi, °°», forma 
linearly independent set of 2 elements and suppose that 2, EL. 
For such a pair # and §, and for any real a, we have a¢+(1—a)jEL, 
for =0 implies 8;=0, i=1, - - - , n. Thus the 
line segment joining # and 9 is in L. Therefore (c) if 3 and fare ad 
independent in L, they must belong to the same component of L 


*S. Banach, Théorie des opérations linéaires, Warsaw, 1932, see p. 182. 


5 The use of the set K and of this argument was suggested by the referee to replace 
a more lengthy Heine-Borel argument. 
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If # and 4# are two linearly independent sets of » elements in X 
which are not fully independent, consider a set 4 of m elements 
which is fully independent of # and also of 7. Such a set @ exists be- 
cause ¥ is assumed to be infinite-dimensional. The subspace F can be 
taken to contain all the vectors x;, y;, and 2;. For the L which cor- 
responds to this F, %, §, and 4 must lie in the same component by 
(c). Then by (b) there exists a UE @ with U(z) = 5. This completes 
the proof. 

Next we consider all our transformations as being in the space 
€(X). For €(¥) the strong topology is defined, following the ideas of 
von Neumann,‘ as that where the neighborhoods of a transformation 
U, are those of the form 


N(Uo; Lk, €) 
= {T &(X)| ||T(«) — <e,i=1,---, k} 
where €>Oand x;€%,i=1,---,k. 


THEOREM 2. A subset U of €(X) satisfying the conclusion of Theorem 
1 is dense in the strong topology of €(%). 


It suffices to show that the neighborhood N(Uo; x1, - - - , xz, ©) 
contains an operator in &. By renumbering, if necessary, we let 
%1, - + +, x,bea linearly independent subset of the x’s which generates 
the same linear manifold as is generated by all of them. We put 
xi= 5-1 ---, kand A =max |a,;| for these values of 
and for 7=1,---,7r. Also we set »=min fe, e/(rA)}. We choose 
Wi, * ++, W, in ¥ where each w; is linearly independent of the ele- 
ments Uo(x:), 7=1,---,7, and of the previously selected w’s and 
each ||w,|| <<. The collection { Uo(x;)+w;} is a linearly independent 
set. By the assumption on YA, there exists TEW such that T(x;) 
= U,(x;)+wi, t=1,---,7. Thus for these values of 7, || T(x) 
— Uo(x;)||<e. For i=r+1, -- +, we have 


| 


| ||T(x;) — Uo(x,)|| <«. 


Then TEN( Up; x1, - + , Xe; €). 


CorNELL UNIVERSITY 


6 J. von Neumann, Zur Algebra der Funktionaloperatoren und Theorie des normalen 
Operatoren, Math. Ann. vol. 102 (1929) pp. 370-427. 
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ON LACUNARY DIRICHLET SERIES 
I. I. HIRSCHMAN, JR., AND J. A. JENKINS 


The following theorem is suggested by a result of S. Mandelbrojt 
[2, p. 101 ]* concerning lacunary Fourier series. 


THEOREM 1. Let f(s)= axe", where 
limi. An= ©. We denote by y. the abscissa of convergence of this series 
and by the abscissa of analyticity. Itis assumed thaty.< ©,Yy.>—@. 
Let v<1 be the exponent of convergence of the X;'s and suppose that 


| fsa + 0) | = Ofe-*] (o> 0 +) 
where Sa=Yattta. Then if p>v/(1—v), f(s) =0. 


We shall in what follows prove a more general theorem including 
Theorem 1 as a special case. The methods of the present paper are 
closely related to and in part derived from the work of L. Schwartz 
[4]. However no appeal is made to other than standard theorems of 
analysis. 

Let m(0)=0 and let m(c) be an increasing function defined for 
0<o Sa, a>0. A function f(s) which we may suppose analytic in the 
half-plane o>‘; is said to have a zero of modular order m(c) at 
+171 if 


| f(s: + 0) | S m(o) 


for some b>0. Let us define o as a function of p, ¢=7(p), by the equa- 
tion e~** = m(c). Since m(c) decreases as ¢ decreases to 0 this defini- 
tion is effective. It is n(p) which we shall use as the measure of the 
zero of f(s). As an example, if =exp [—(o~*) | then n(p) 

As the measure of the degree of lacunarity of our Dirichlet series 
we introduce 


re) = Slog (1 


If the sequence {,};° has exponent of convergence », then by a stand- 
ard theorem on integral functions [5, p. 251], 


$(p) = O(p’**) (p—> + 


Presented to the Society, September 2, 1949; received by the editors April 11, 
1949 and, in revised form, May 20, 1949. 


1 Numbers in brackets refer to the references cited at the end of the paper. 
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We may now state our principal theorem. 


THEOREM 2. Let ---, and let be 
defined as above. Let 


f(s) = 
k=1 
have abscissa of convergence y.< © and abscissa of analyticityy.>— ©, 


and let y.+i7. be a zero of f(s) of modular order m(a). Then tf n(p) 
is defined as above and if 


(1) lim inf — pn(e) = — 


we have f(s) =0. 


In the case considered in Theorem 1 we have, as we have seen, 
€(p) =O(p’**), n(p) The assumption >v/(1—v) shows that 
condition (1) is satisfied. Thus Theorem 2 does include Theorem 1. 

It is clearly sufficient to prove our theorem for y.=7T.=0. 

It is immediately verifiable that the function 


>0 
(1 +3) (> +pt+ez (z p) (p ) 


is analytic for x20, and that 


f | ka(a + iy, p) < (x = 0). 
By a simple extension of Plancherel’s theorem, see [3, p. 8], we see 
that if 


1 
$(n, p, ¢) = Lim.— eds (O0OSa< 
—iT Ra(\n + p) 


then 

(2) (x > 0). 
0 + 2, 

Further 


1/2 
=[ ) < [212,00 + 0, 


We assert that 
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(3) lim sup ||¢(n, p, < 0, 


We have 


o)|| 


1 Nn + 
—1 
An 


which proves our assertion. 
We define 


F(p, s) = 
We assert that if 


1/2 
0 


(4) lim sup ||F(p, o)||2e" < 


then 


Let M be a bound for f(¢) for 0<a < «. We have 


n(p) 1/2 1/2 
| | +| f(g ae | 


m(n(p)) + Meer) 
(M + 


IIA 


IIA 


which proves relation (4). 

It follows from a well known theorem concerning restricted over- 
convergence of Dirichlet series, see [1, p. 141], that there exists an 
increasing sequence of integers J, for which 


lim aj = 


— 
| 
| 
| 
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uniformly for e<a< © for any e>0. Thus if e>0, 


f p, o)F(c + €, p)do 
0 


= lim O(n, p, 
0 


j=1 
By equation (2) 
0, j#n, 


j=n. 


f O(n, p, g = { 
0 


Hence 


p, o)F(p, + €)do = 
0 
By Schwarz’s inequality 
| an| p, + «, 
If € is allowed to approach zero through positive values, we obtain 
Using equations (3) and (4) we see that 


S — pn(e) + C 


log Gn 


where C is a constant which depends on u but not upon p. If we allow 
p to increase without limit, assumption (1) of Theorem 2 implies that 


log | a,| = — ~, that is, a, = 0. 


Since this holds for each n=1, 2, - - - , f(s) =0 and our theorem is 
proved. 
We shall now prove that if assumption (1) is replaced by the 
weaker assumption 
lim inf cf(e) — pn(p) = — © 
where c<2-*/2, then Theorem 2 is false. 


Let us take for the constants Ay, O0<a<b<1<4<9< ---+.Asin 
Titchmarsh [5, p. 271] we find that 


(5) log | =)| 0). 


1 


—— 
| 

| 

| 
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It follows that 


2p 2p 
(6) $(p) = log (1 + *) + log (1 + 2) 
+ log (1+ 2) ~ (p— 
k=l k? 
We define 
1 
(7) dz. 


Deforming the line of integration of this integral to Rl z= —(m+1/2)? 
and allowing to approach infinity we obtain for s real and positive 
an expansion of f(s) into a Dirichlet series with exponents a, 3, 1, 
4, 9, - - - which then converges in the half-plane RI s>0. The de- 
tails of this are left to the reader. Again deforming the line of integra- 
tion, this time to RI z=(a/2c)?, and using equation (5), we see that 
if €>0 is fixed, then for ¢>0 sufficiently small we have 


1 — 2e x? 
| | exp [- =| M (co) 
4 


where 
1 dz 
M(c) = — 
| 
a b 
= o(1) 0 +). 


Thus we may associate with f(s) and the origin the modular order 
m(a) =exp [—((1—2e)/4)(x?/c) |. We immediately find that 


(8) n(p) = (1 — 


Comparing equations (6) and (8), we deduce that Theorem 2 is false 
if ¢<2-%/2, which is what we wished to show. 
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ON THE ABSOLUTE CONVERGENCE OF 
TRIGONOMETRICAL SERIES 


TAMOTSU TSUCHIKURA AND SHIGEKI YANO 
1. THEOREM 1. Suppose that a trigonometrical series 
> pa cos (nx — cn) (on 2 0, m = 1, 2,---) 
and its conjugate series 
Pn Sin (nx — ap) 
are convergent absolutely at x=xo and x=x,, respectively. If 
— = pr/g (p/q irreducible) 


where p is an integer positive, negative, or zero, and q is an odd integer, 
then 


Dd pa < =. 


We shall see in Theorem 2 that the above theorem is no longer 
true if p/q is replaced by p’/g’ with an even gq’ and p’0, or by an 
irrational number. 

PRooF OF THEOREM 1. If we put x1—xo9 = p2/q=h, then by Fatou’s 
theorem! 


(1) >» cos (n(x; — sh) — an) | < 
for every odd s. Hence from the identity 
sin (nx; — a,) = cos msh sin (n(x; — sh) — an) 
+ sin nsh cos (n(x, — sh) — aa) 
we deduce immediately that 
(2) D> pn| cos sin (n(x; — sh)) — an) | < @. 


Received by the editors October 18, 1948 and, in revised form, May 14, 1949. 
1 See, for example, A. Zygmund, Trigonometrical series, p. 134. 
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On the other hand from (1) we have 

(3) > cos nsh cos (n(x; — sh) — an) | < 

hence from (2) and (3) 


(4) pn| cos nsh| < 


if s is an odd number. 
If we replace s in (4) by an odd g, then we get 


q | 


which is the required inequality. 

We may also prove the theorem geometrically by using the cited 
Fatou’s theorem concerning the symmetry of points of absolute con- 
vergence, and the fact that if the series 


> pn cos (nx — an) 


converges absolutely at a point x’, then also it converges at the point 


x’ +7. 


THEOREM 2. For a given sequence {p,} of positive numbers such that 
©, and an irreducible fraction p'/q' with an even q’ or 
an irrational number d, we can construct a trigonometrical series 


> p» cos 
which is absolutely convergent at x= p'-13/q’ or x=Xm respectively. 


Evidently the conjugate series p, sin n,x converges absolutely 
at x=0. 
For the proof of the first part of the theorem, we put 


n, = (2v + 1)q//2 = 1,2,--+)s 


then, p’ being an odd number, we have 


cos n, — 
q 


cos (2v + =| = (. 


For the second part we consider the sequence 
{ } eee 


which is uniformly distributed in the interval (0, 27) by Weyl’s 
theorem; then we can find a strictly increasing sequence {m,} such 
that 
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2 
| cos mAwr| < 1/( 
p=l 


Py» 
> cos mArAr| ,? 
( > »») 
THEOREM 3. Let >\p,= ©, and one of the following conditions be 
satisfied: 
(i) Pn is non-increasing, 
(ii) pn/pm ts bounded uniformly for n=m>0, 
(iii) 1/p,=O(n?). 
If >op. cos (nx—a,) converges absolutely at x=xo, then it does not 


converge elsewhere; moreover the conjugate series > pn sin (nx —a,) does 
not converge anywhere. 


Thus we have 


The first part of the theorem is due to R. Salem.’ For the proof of 
the second part we suppose, if possible, that 


> pa sin (nx — ap) 
converges absolutely at x =x, then by (4) 
> pn| cos — x1)| < 
from which we deduce that 


by the argument used by Salem to prove the first part. This is a 
contradiction. 


2. Consider the trigonometrical series 
(5) cos (nx — an) (p, = = 1, 2,---). 


If >°p,= ~, then the set of absolute convergence points of (5) is of 
measure zero by the Denjoy-Lusin theorem. On the other hand by 
Theorem 3, (iii), if }°%_,; 1/p,=O(n®), the series (5) converges ab- 


2 This proof is shown by Professor Izumi. We can easily find a sequence {n,} 
and a set E of generalized Cantor type such that ArCE and |cos ny| <hi Pu)? 
(v=1, 2,-- - ) for every xCE, that is | cos nyx| < for every x in E which in- 
cludes Ax. Therefore we see that each of the series > pr cos mx and De» sin m,x has 
the perfect set of absolute convergence points. 

3 R. Salem, The absolute convergence of trigonometrical series, Duke Math. J. vol. 8 
(1941). 
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solutely at most at one point. As a counterpart of this theorem we 
prove the following theorem. 


TuHeEoreM 4. If ©, and p,=O(1/n), then the set E of absolute 
convergence points of (5) ts of a-capacity (0<a<1) zero.* 


For the proof, let E be of positive a-capacity (0<a< 1); then by 
the Salem-Zygmund theorem,’ there exists positive distribution 
uC Lip concentrated on E. If we put 


1 
du~ a + > (a, cos nx + b, sin nx), 


then 


ad, sin nx — 5b, cos nx 


1 nN 


and, u(x) being monotone, the left-hand side is of bounded variation. 
Hence by a theorem due to Zygmund® we have 


(6) > (| an| +| b,|)/n < 


Let us put u(H)=fadu for HE(0, 2x). By Egoroff’s theorem, for 
any e>0 (e€<1/6), there exist a set ECE and a positive number M 
such that u(E) >1—e and 


> pa| cos (nx — an)| << M (xE €). 
1 
Integrating both sides we get 


fel cos (nx — an) | du < Myu(€), 


n=1 


where 


4R. Salem and A. Zygmund, Capacity of sets and Fourier series, Trans. Amer. 
Math. Soc. vol. 59 (1946) pp. 23-41. The a-capacity of sets is defined as follows: The 
distribution of positive mass on the unit circle || =1, z=re*, shall be supposed to be 
a non-negative additive function of sets, from which we define a point function u(x) 
non-decreasing for 0Sx S27. It will be supposed that the total distribution u(27) 
—y(0)=1. If for a (Borel) set E, fedu=7"du, we say that a distribution » is concen- 
trated on E. A set E is said to be of positive a-capacity, if there exists a distribution 
» concentrated on E and Va sirdu(t)/|et—re=|@ is uniformly bounded in x as r—1. 

5 Salem-Zygmund, loc. cit. footnote 4. 

* Zygmund, loc. cit. footnote 1, p. 136. 


3 

| 

| | 
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2 cos (nx — an) | du 


IV 


fea — 2 cos 2a, cos 2nx + sin 2a, sin 2nx)dp 


IV 


1 — cos 2a, cos 2nxdu + sin 2a, sin 2nxdu 


ll 


1 008 f 


cos 2nxdu + sin f sin 2nxdu 


— cos f cos 2nxzdu — sin f sin 2nxdu 
_E 


IV 


2r 2r 
1 — e+ cos dan f cos 2nxdu + sin dan f sin 2nxdu — 2e 
0 


> 1 — 3¢ —| aon| —| dan. 
Consequently 
(7) 2M = — 3e-) — pa(| aon| +| ben ). 


On the other hand, by (2) and pz=O(1/n), +| den] ) 
=0( +| b2n|)/) < © which contradicts (7). 


THEOREM 5. If (5) <s convergent absolutely on a set of positive a-ca- 
pacity (0<a<1), we can divide the suffixes into two classes {n;} and 
{m;} such that 


For, we get (6) by the hypothesis, and then we can deduce the 
theorem by the usual argument. 


REMARK. Let yp be a distribution concentrated on E. If 


2r 1 
0 | om rei= | 


is uniformly bounded in x as r—1, we say that E is of positive 
logarithmic capacity. Salem* gave an example such that }'p,= 
and >°p, cos (nx—a,) converges absolutely on a set of positive 
logarithmic capacity. But for a-capacity such an example seems to 
be open. 


TéHoKU UNIVERSITY 


1 For example, see Zygmund, Sur la caractére de convergence des séries orthogonales, 
Matematica vol. 9 (1935) pp. 86-88. 
8 Salem, loc. cit. footnote 3. 
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ON DOUBLE FOURIER SERIES 
GEN-ICHIRO SUNOUCHI 


1. In the present note, the author proves some inequality theorems 
on double Fourier series. We begin with some preliminary lemmas. 
Let f(x, y) be an integrable function and let its Fourier series be 


f(x, y~ p A m,n(%, y) 
and let 


2v+H--1 


9) = Amn(x, 9): 


n=2” 


LemMa 1. Let Sy.m,n(x, y) be the m, nth partial sum of Fourter series 
of f(x, ¥); then 


( Sram n(2, yy)" 


Qn y/2 
< A, f dxdy, 
0 0 
Lemma 2. If y>1, 


2r 7/2 
0 0 
B, f f | f(x, y) 
0 0 
(x | y) ) dxdy. 
0 0 py 


These lemmas are due to J. Marcinkiewicz [2]. 


IIA 


IIA 


2. THEOREM 1. If y>1, 


2r 2r o © y/2 
0 0 


m=0 n=0 
cy f | f(x, y) |"dxdy, 
0 0 
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2r 2m | 2r 2r 
f f | max Som dxdy D, f | f(x, y) |"dxdy. 
0 0 | mn 0 0 


Accordingly so s(x, y) converges to f(x, y) almost everywhere, when 
F(x, y) EL? (y>1). 


By Schwarz’s inequality 


| S2™ 2” 2" |? 
n 


2141 441 


In view of Lemma 1, we get 


2r y/2 
0 0 
2r 1 1 


02 (2'24| somo» — Pasay. 


i=0 j=0 


Since | S we have 


i=0 j=0 


t=0 j=—0 l=j 


2. 


1 m n m n 
SLU 


AEE 


kei 
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sol?) 

m n p=0 
| 
| 
| dxdy 
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(25 > | Ax. |2(2*24)1/2 


k=i l=j 


t=0 j=0 


LM TM: 


Ax,1|?2*2! 


IL 


k=O 2” 2° 


By using Lemma 2, we get the first half of the theorem. The remain- 
ing part is an easy consequence of the former. 


3. Lemma 3. If k>1 and y>1, we get 


2r 2r vik 
(Sl 
0 0 
2r 2r ylk 
f f ( Lisl dxdy. 
0 0 


The proof is carried out analogously to Marcinkiewicz’s proof of 
Lemma 1, using the result of Boas and Bochner [1]. 


THEOREM 2. If R22 and r>1, 


2r ax ky yik 
(ee 
0 m mn 
ps 
4 Fy | f | f(x, y) |"dady. 
0 0 


Accordingly if f(x, (y>1), 


= (k= 1) 


almost everywhere. 


The proof is analogous to my previous paper [4], using Lemma 3. 
The last part of the theorem was obtained by Marcinkiewicz [3] in 
more generalized form. 


= 
| 
=> 
| 
| 
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Lemma 4. Jf r>1, then 


0 m mn 


The proof is easy (cf. the author’s note [5]). 


THEOREM 3. If y>1, 


| f(x, 9) |"dady 


2r 2r o © 
s6,f 
0 0 


m=0 n=0 
and 
2r 2r 
f f | f(x, y) |"dxdy 
0 0 


(EE 


The proof is immediate by Lemma 4 and Lemma 2. 
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ON THE STRONG SUMMABILITY OF FOURIER SERIES 
GEN-ICHIRO SUNOUCHI 


This paper consists of three parts. In the first part we complete one 
of the deepest theorems due to Littlewood and Paley [3, 4]. Our 
theorem was already proved by Marcinkiewicz and Zygmund [4]. 
But the proof given here is very simple and direct. In the second part 
a strong summability theorem is proved. It is the completion of our 
former theorem [5]. We use the method due to Zygmund [8] for the 
proof. Finally we prove a strong summability theorem concerning 
lacunary sequences of partial sums (Zalcwasser [6]). 


1. Let f(@) be an integrable function with period 27 and its Fourier 
series be f(0)~ (a, cos n0+b, sin nO), assuming for the 
sake of simplicity. If we put z=pe* and 


ta = - aor. 


n=1 


then (6) becomes the boundary function of the harmonic function 
Rd(z). Let us denote by p, g, and r real numbers satisfying 


©, 


and by A, B, - - - absolute constants, not always the same from one 
occurrence to another. 


THEOREM 1. Let f(0) be a function of the class L’, then 


#) — p)| |% d 


<B f | 
0 


2r 1 riq 2r 
(1.2) f {f (1 — p)**| $'(z) do < cf | o(e%*) 
0 0 0 


rip 


(1.3) bf “| rao s f {f (i — dé. 


(1.1) is due to Littlewood and Paley [4]. In order to prove (1.2) we 
put 


(1.1) 
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(6) = max (| ¢(2) |; € S(®)) 
where S(6) is a kite-shaped region.? Then |¢’| <A/(1—p). If we put 


1 1/2 
0 
then we have 


2r 1 rig l/r 
< { f (f — as} 
0 0 
2r l/r 
(4+. 
0 
< 


2r 
(3 f 
0 


Since 0 <2/q <1, if we apply Hélder’s inequality, the right-hand side 


becomes less than 
2/¢\ l/r 


By the maximal theorem due to Hardy and Littlewood the last term 


is less than 
l/r), 2/¢ 


< e( 


Thus we get (1.2) by (1.1). 
For the proof of (1.3) we put 


2 See Littlewood and Paley [4]. 
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Then we have 


(1 — p)| ao le 


2r 1 r/2 l/r 
< { f ( 
0 0 w 
2r 1/r 2r \ 
< {f | < Bf f or 
0 0 
1—p/2 2r p/2\ 
0 0 
1—p/2 2r l/r\ p/2 
0 0 
2r 1—p/2 2r l/r\ p/2 
0 0 
tk 


Thus we have 


2r l/r l/r 
0 0 


Hence by (1.1) we get (1.3). 


2. THEOREM 2. Let s,(e*) and 7,(e*) denote the partial sums and the 
arithmetic mean of Fourier power series of o(e*) belonging to the class = 
Hr, respectively. Then we have 


r/2 
A f | o(e*) < f (XI sen — 7,(e%) d0 
0 0 n=1 


(2.1) 
B | o(e#) | 


2r 1 riq 2r 
(2.2) f (x sa(e#) — ta(e*) do < cf | 


n=1 


lA 


"dé, 


n=1 


@.s) Df "| f = “ao | 


(2.1) is due to Zygmund [8, 9], and (2.2) is given in my former 
paper [5]. For the proof of the theorem we require the following 
lemmas. 


19 
- 
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LemMaA 1. Let fi, fo, - - - be a sequence of integrable functions and 
let s,,» denote the vth partial sum of the Fourier series of f,. Then 


where r>1 and m>1. 
The inequality is also valid for conjugate series. 


This lemma is due to Boas and Bochner [1]. 
If we write 


s(p, 0) = >> (a, cos + b, sin n6)p*, 


n=l 


f(e, 0) = (an cos nO + bp sin n6)p*, 


n=1 
then we have the following lemma. 


Lemna 2. Let s,,(p, 9) denote the sum derived from f,(0) similarly as 
S»(p, 0) from f(@). Then under the hypothesis of Lemma 1 we have 


( Snkn(Pns 9) In) <A ( | f.(0) “ae, 


where 0OSp,1. The sums s,,, may be replaced by the conjugate ones. 


This lemma is easily derived from Lemma 1 by Zygmund’s 
argument by a slight modification. 


Lemma 3. Let 0Sp,<1 and A, denote an arbitrary interval situated 
in (pn, 1). Then under the hypothesis of Lemma 1 we have 


(ELS, 


The sums Sn,, may be replaced by the conjugate ones. 


This is immediate from Lemma 2. 
ProoF OF (2.2). We have 


Sa(e*) — ra(e#) = — (e*)/(m + 1). 


= 
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By Abel’s transformation 
Sa (e#) = (pe) — (1 — p) (pe* 
y=0 
Therefore 


| sw (e#) [2 


IIA 


A Sx +(1- > p> | sf (pe#) ) 


lA 


s} (pe 


Let p,=1—1/(m+1) and A, = (pn, Pasi) and let J be the left-hand side 
integral of (2.2). Then we have 


rscf | sa (one) |* 
ntttpan 


io} 


= riq 


(a— l)n 


spf 


n 


n— > lq 
+ | (pe*) ‘dp) 


n 


ria 


cf" | 


IIA 


lA 


from (1.2). Thus we obtain (2.2). 
PROOF OF (2.3). We may write 


| 
| 


| | = 


"|. 


If we put p,=1—1/n, then we have 


| 

— 

— 

| 

| 
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py 
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(1 — p)?-*| |"dp 


Pp 
dp 


ont 
n=1 1 v=1 
<B> s+ | pati) + BL | 
n=1 v=1 n=1 n*P 
=P+0Q 
say. We have 
1 n n pl p’ 
| 
Ps 
ant +1 v=1 y=1 
1 n 
| 
<c> sal st |») 
aml 1°" 
1 n | sf Pp 
where 1/p+1/p’ =1, and 
1 Pp 
| 
Qs ( 
err y=n+1 
1 | sf |p pip’ 
(142/p)p’ p’(»—1) 
1 | sf |? p/p’ 
p’—2 p’(r—1) 
ani v=n+1 y=n+1 
p==l 
Since st| /(+1) = we get (2.2) by (1.2). 
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3. THEOREM 3. Let f(@) be a function belonging to L’ (r>1), and {pn} 
and {dn} be arbitrary increasing sequences of positive integers such that 


| 
| 
| 
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Then we have 


(3.1) f (21+. 


Especially if 


ria 
do < Af | (q = 2). 
0 


= — Pn); 
then 


(3.2) | sp, — < (m = 1) 


v=1 


V 


for almost all 6. 


The proof of (3.1) runs similarly to that of (2.2). In this case we 
have to replace A, and p, by 


— 1/pn, 1 — 1/Pn+1), if Pati < 
a, 1 — 1/90), if 2Pm 


and p,=1—1/(p,+1) respectively. 
(3.2) is obvious from (3.1) by Kronecker’s theorem. 
REMARK. The formula 


1/d(x) ~ cp'(x)/p(2) 


gives the relation between the strong summability factor d(x) and 
the lacunary sequence p(x). If we put d(x) =1, then p(x) =2?, which 
is nothing but the Kolmogoroff theorem [2]. If we put d(x) =x, 
x2, and so on we get p(x) =x*, 2°", and so on, respectively. 

We conclude with the following theorem which is proved by an 
argument due to Zygmund [7, 8, 9]. 


n 


THEOREM 4. If f(@) belongs to L’ (r>1), then for almost all 0, the 
sequence {1, 2,-°° } can be divided into two complementary sub- 
sequences {ny} and {my}, depending in general on 0, and such that 


Sp,, (8) tends to f(@), 
the series >. 1/dmy converges, 
where dn}, >.1/d,= and 
Pn/dn = O( Pasi — 
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Touoxku IMPERIAL UNIVERSITY 


ON THEOREMS OF M. RIESZ AND ZYGMUND 
A. P. CALDERON 


Several proofs have been given of the results of M. Riesz and 
Zygmund. 

(a) The conjugate of the Fourier series of a function f(x) of L®, p>1, 
is Fourier series of a function f(x) of the same class, and 


9 


| f(x) A, | f(x) 
0 0 


holds, A, is a constant depending only on p. 

(b) If the function | f(x) | log* | f(x) | ts integrable, the conjugate of 
the Fourier series of f(x) is the Fourier series of a function f(x) of the 
class L. Moreover, there exist two constants A and B such that 


li@larsa f "| | logt | f(x) | dx + B. 


In view of the importance of these theorems it may be of interest to 
give another proof of them based on a different idea. Actually it is 
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easy to see (and very well known) that they are equivalent to the 
following proposition about analytic functions. 

Let F(z) =u(z)+70(z), u(z)>0, v(0) =0 be a function regular inside 
the circle || <1. Then the inequalities 


(1) | v(re**) 


| v(re‘*) | dx 


IIA 


2r 
A, | u(ret#) |pdx; 23% < 
0 


cf” | u(re**) | dx 


IIA 


(2) 
+ bf u log udx — 2rDu(0) log u(0) 
0 


hold, C and D being positive constants and A, the same as above. 
Let us prove (1) first. The argument is based on the inequality 


(3) | sin ¢|” < A,| cos ¢|” — B, cos po 


valid for —7/2S¢S7/2, 1<pS2, A, and B, being two positive 
constants depending on p. 

The proof of it is simple. Since cos p6<0 for 6= +7/2, taking 
B, large enough we get —B, cos pp>1 in a small neighborhood of 
@= +7/2. Now | cos ¢| ? is greater than a positive constant in every 
closed interval interior to (—7/2, 7/2), so that choosing A, large 
enough we shall have A,| cos ¢| B, cos pp>1in (—7/2, 7/2) anda 
fortiori the inequality (3). 

To show (1) now, let us set 


F(z) = u = Rcos ¢; v= Rsin ¢. 


Owing to the fact that u>0, we may choose —17/2<¢<7/2. For the 
same reason, F(z) #0 and so F(z)? is regular in |z| <1. Integrating 
now along the circle | s| =r we get 


1 F(z)? 1 
| R? cos podx = R[F(0)?] = u(0)? > 0. 
Cc 0 


Zz 


Multiplying now (3) by R? and integrating, we obtain 


2 


2r Qr 
f | A, | u(re‘*) B, R? cos podx, 
0 0 0 
but since the last term is positive, we may drop it, and the desired 
result follows. A similar argument works in proving (2). Here in- 
stead of (3) we use 


as 


| 
(4) 
wh 
the 
R 
F 
we 
2a 
| 
} 


> 
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| sin ¢| < Ccos ¢+ D(cos ¢ log cos ¢ + ¢ sin 9g), 


which can be proved in exactly the same way. 
Again, since F(z) does not vanish in | z| <1, F log F is regular 


the 


re, and integrating along | z| =r we get 


Tt 


1 dz 1 al 
F(z) log F(z) “| = R[F log = u(0) log u(0), 
2 Cc 2r J 


and replacing 


R[F log F] = R[R(cos ¢ + i sin ¢)(log R cos ¢ — log cos ¢ + i¢)] 


we 


1 
2a 


= Rcos ¢ log R cos ¢ — R(cos ¢ log cos ¢ + ¢ sin ¢), 


obtain 


f R(cos ¢ log cos ¢ + ¢ sin ¢)dx 
0 


1 2r 
~{ u log udx — u(0) log u(0). 
0 


Oe 


Finally, multiplying (4) by R and integrating, we obtain 


as 


2 


| v(re’*) | dx < | s(re*) | x 


+ pf u log udx — 2xDu(0) log u(0), 
0 


A 


stated in (2). 
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SUMMABILITY AND ANALYTIC CONTINUATION 
Vv. F. COWLING! 


1. Introduction. In this paper we describe a new family of Toeplitz 
summability methods, and we study the regions in which these 
methods sum a Taylor series to the analytic continuation of the 
function which it represents. 

Let A =(damn) and x= { Sm } (n, m=0, 1,---) be a matrix and a 
sequence of complex numbers, respectively. We write formally 


(1) t, = A,(x) = >> damm, 
m=0 


and say that the sequence x (and the corresponding series 
>-0 (Sm—Sm—1), With s_;=0) is summable A to the sum ¢ if each of 
the series in (1) converges and lim ¢, exists and equals ¢. We say that 
the method A is regular provided it sums every convergent sequence 
to its limit. The method A is regular if and only if 


(2) | anm| (s = 0,1,---), 
m=0 

(3) lim dam = 0 (m = 0, 1,---), 

(4) lim >> dam = 1, 


m=0 
where K is a constant independent of m (cf. Toeplitz? [2]). 


2. The methods F’. For each constant r (r #1) the element of the 
matrix A =F’ shall be defined by the equations 


(5) Gnm(r) = 0 (m <n), 
Onm(1) (1 (m =n). 
We note first that, for »=0,1,--- and |r| <4, 


= (1 — = 1 
m=0 
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and 


1— ntl 
= (255) 


1—|r 


If on the other hand |r| then |dan(r)| =< 
(n=0, 1, - - - ). Therefore the method F is regular if and only if r 
is real and satisfies the condition 0<r<1. 


THEOREM’ 2.1. Jf AB denotes the matrix product of A and B, 
Fu = = Fr, where r=r3+re—nife. 


Let F=(dam), FO F*=(Cam) and FF" =(y,,,). Then 
Cam=0 when n>m. When 


Cram = OnkDim 
k=0 


k=n 


r n r k 


k=0 T2 


n+1 m—n 


(i — — 12) + £2 — 
(1 —(n+r— ris) + re — rire) 


Since the last expression is symmetric in 7; and r2, Yam=Cnm, the 
proof is complete. We note the corollary thet, for r#1, the matrix 
F’ has the inverse F*, where p= —r/(1—7). 


m—n 


THEOREM 2.2. Jf r; and rz are real constants (OS11<1r2<1), every 
bounded sequence summed by F" is summed by F*, and to the same 
sum. 


Let then the method is regular, and 
= FrF", by Theorem 2.1. The present theorem now follows from a 
remark by Agnew [1] (cf. p. 328). 


3. Summability of Taylor series. Henceforth, {un} and {sm} 
shall denote sequences of complex numbers related by the equations 


Sm = Uo + + +++ + (m = 0,1,---). 
Lemma. If 


3 The author is indebted to A. Wilansky for helpful discussion regarding this 
result. 
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(i) the series um 2” has a positive radius of convergence R; 
(ii) | r| <R; 
(iii) ta (n=0, 1,---); 
(iv) Ta=tot+ht+ ---+t, (n=0,1,---); 
then 


m=n 
Equation (6) asserts that a certain sum of Taylor series is equal 
to another Taylor series. All the series in question converge because, 
for any fixed positive integer m, the respective radii of convergence of 
the four series 


> Umz™, Su3%, Cu > CmnSm2™ 
m=0 m=0 m=0 m=0 


are equal. The validity of the equation will now be established by 
induction. 


For n=0, the equation (6) reduces to the identity 
(1—r)>> sar™ = >> 
m=0 m=0 
But if the equation holds for n=k, that is, if 


1 — 


k Cu 
r m=k 


then 
Tesi = Te + tess 


1 — r\*1 
= ( ) Cm + Cm k41(Sm — 


r m=k m=k+1 
1 — r\*! 
( ) { [Comer + Cm, Cosas 
r m=k+1 


+ = 


«= 

r m=k+1 


m=k+1 


-- 

| 

| 

} 


yf 
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In other words, equation (6) also holds for n—k+1. This proves the 
lemma. From the lemma it follows that if 0< | r| <R, where R is the 
radius of convergence of the series }um2™, the relation 


1 n 1— mt+1l 


n 
n=0 r m=n r m=n 


holds in the sense that the existence of either member implies the 
existence, with the same value, of the other member. The following 
result is now immediate: 


THEOREM 3.1. If the series > >"_o um2™ has a positive radius of con- 
vergence R, the series ) Um is summable F* to the sum L for any constant 
r (0<|r| <R) for which the series 


1 n @ 
( ") > 


m=0 rT m=n 


converges to L. 


4. Analytic continuation by means of the methods F’. Let D bea 
simply connected region in the complex plane, C a simple closed 
Jordan curve lying in D and bounding the finite region D’. Corre- 
sponding to each complex number r (r#1) we define an open set 
R(r, C) as follows: R(r, C) is the set of all points in D’ for which 
|z—rz| <|t—rz| whenever ¢ lies on C. A set B shall be said to be of 
type R*(r, D) if B is closed and if C can be chosen in such a way 
that B is a subset of R(r, C). In the following theorem, f™(rz) de- 
notes the nth derivative with respect to w of f(w) at the point w=rz. 


THEOREM 4.1. If f(z) is uniform and regular in the region D, r is a 
complex constant (r~1), and B is a set of type R*(r, D), then the series 


(7) f™(r2)(1 — 1)*2"/n! 


converges to f(z) uniformly in B. 


If z lies in B, 


1 t)d 1 t 1 


t—2 1 — — rz)/(t — rz) 


1 t 
t— \t — 


where C is a curve in D such that B is a subset of R(r, C). The series 


| 
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in the last integrand converges uniformly with respect to ¢ on C and 
zin B, because, for such values of ¢ and z, 


| (z — rz)/(t — rz) | <k < i, 
where the constant K depends only on B and C. Therefore 


2ridc (t — 
But the point rz is inside of the curve C; for otherwise the line seg- 
ment joining the points z and rz would meet the curve C at some 
point ¢’, the inequality | (z—rz)/(t’—rz)| <1 would not be satisfied, 
and the point z would not lie in B. Therefore the last integral has the 
value 27if™ (rz) /n’, and the theorem is proved. 

The hypothesis that the region D is simply connected will now be 
removed. Let D be a connected open set, E its boundary, and ra 
complex constant (r#1). By Q(r, D) we shall denote the set of all 
points in D for which the inequality |z—rs| <|t—rs| is satisfied 
whenever ¢ lies in E. 


THEOREM 4.2. Let the function f(z) be regular and uniform in the 
bounded region D, and let r be a complex constant (r #1). Then the series 
(7) converges to f(z) absolutely in Q(r, D), and the absolute convergence 
ts uniform in every closed subset of Q(r, D). 


Let B be a closed subset of Q(r, D), and z a pointin B. It is to be 
shown that the series 


(8) — 1)"s0/n! 

n=0 
converges to f(z), and that the absolute convergence of the series (9) 
is uniform with respect to 2 in B. 

We observe that (8) is the Taylor series of f(z.) about the point 
2=rz.‘ Since B is a closed subset of Q(r, D), there exists a positive 
number e such that | (20 —r20)/(t—r20) | <1—2e when % is in B and 
t is on E. The Cauchy estimate for the coefficients of our Taylor 
series gives the result 


| £(r20)/n!| < M(r20)/[(1 — 


where 4, is the distance from rz to the set E and M(rz)) denotes the 
maximum modulus of f(z) on the circle |z—rzo| =(1—e)d. As the 


4 The author is indebted to A. M. Gleason who pointed out this fact in an oral 
communication. 
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point 2) ranges over the set B, the points 
z= rzo + — ©)dg (0 $6 < 2x) 


range over a subset of D which is bounded away from the set E, 


and therefore the quantities M(rz) have a common finite upper 
bound M. It follows that 
2” — 
()(rz9)(1 — 


and the theorem is proved. 


<(1—)*M, 


THEOREM 4.3. Let D be a connected open set containing the origin, 
f(2) a uniform function regular in D; let the series > am 2™ converge to 
f(2) in a circle of radius p; and let r be a complex constant (r~1). Then, 
in every closed set which is contained in the intersection of the set Q(r, D) 
and the circle | z| <p/|r|, the series > an2™ is uniformly absolutely 
summable F* to f(z). 


If r=0, the set Q(r, D) is contained in the circle | z| <p, and the 
result is trivial. To prove the theorem for r~0, we observe first that 
the series 


a_,m(m — 1)--- (m— n+ 1)(rz)™* 


converges to f)(rz) in the region |z| <p/| r| . It follows that the series 
(7) can be written in the form 


n=0 Tr 


m=n 


and upon application of Theorem 3.1 (with u,=a.2") the present 
theorem becomes a corollary of Theorem 4.2. We note that we have 
established analytic continuation of the function represented by the 
series } a," by means of summability methods which need not 
even be regular. 


5. Summability of a special series. We now subject the methods 
F* to the customary test of applying them to the series }>2*. Here, the 
sole singularity of the function f(z) is the point z=1, and the bound- 
ary of the region D can therefore be taken to consist of the point 
z=1 together with the circle | s| =K, where K is arbitrarily large. 
We shall restrict our considerations to the case where F’ is regular, 
that is, where r is real and 0<r<1. If zis any point in the plane, the 


| 
| 
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inequality |z—rz| <|t—rz| is certainly satisfied if ¢ is sufficiently 
large. It remains to examine the inequality for the case where ¢=1. 
Here we have, with z=x-++7y, 


(x? + y*)(1 — r)? < (1 — rx)? + r’y?’, 
(1 — 2r)(x? + y?) < 1 — 2rz. 


Theorem 4.3 now gives the following result: If 0<r<1/2, F’ sums the 
series } 2” to the function 1/(1—z) in the intersection of the region 
|z| <1/r with the interior of the circle having its center at z 
= —r/(1—2r) and passing through the point z=1. If 1/2<r<i, F* 
sums the series in the intersection of the region | z| <1/r with the 
exterior of the circle having its center at z=r/(2r—1) and passing 
through the point z=1. The method F'? sums the series in the inter- 
section of the region | s| <2 with the half-plane x <1. 

Finally, we recall that in §2 the relation F*DF" (0S7<r2.<1) 
was shown to hold in the space of bounded sequences. The relation 
does not hold in the space of all sequences. For we have now estab- 
lished that the series }°>(—5/3)" is summable F’?; on the other 
hand, the transform of this series by F’ does not even exist when 
r2=3/5. 

Added in proof: After the present paper had been accepted for pub- 
lication, an interesting paper by P. Vermes [Amer. J. Math. vol. 71 
(1949) pp. 541-562] appeared which overlaps the results of the 
present paper. In addition R. P. Agnew [Math. Rev. vol. 11, p. 242] 
in a review of a paper on these methods by W. Kénig-Meyer [Math. 
Zeit. vol. 52 (1949) pp. 257-304] points out that the first systematic 
study of these methods is in the thesis of R. Wais [Das Taylorsche 
Summierungsverfahren, Tubingen, 1935 ]. 
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A BESSEL FUNCTION EXPANSION 
MORRIS KLINE! 


1. Introduction. Some work in the theory of electromagnetic wave 
guides led to the following problem, namely, to obtain a useful ap- 
proximation for the expression 


(1) JARR)N(RR + ka) — + ka)N ARR), 


wherein J, and N, are the Bessel functions of the first and second 
kind of order v, and under the conditions that R be large and that 
v=dR, where d is any real constant. It is perhaps worth while to 
record the result, particularly because it is obtained by rather ele- 
mentary means. The result is a Taylor’s expansion of the expression 
(1) about R= = as follows: 


JARR)NARR + ka) — J,(kR + ka)N,(kR) 


(2) a* (sin x 1 
wherein x? = — k?a*¥, y = (d?/k?) —1 and y=dR. A similar result is also 
noted below for the expression: 
(3) J; (RR)NJ (RR + ka) — J; (RR + ka)N? (RR). 


Physical interpretation of the two results is given in §3. 
It might be remarked that it did not appear possible to obtain any 


simple result with the usual asymptotic expansions for the individual 
Bessel functions. 


2. The derivation of the basic expansion. Let a=ka and B=kR in 
(1) and let us denote it by G(a, 8). We shall, for the time being, re- 
gard the order v of the Bessel functions J, and N, as fixed and expand 
N,(8+a) and J,(8+a) in powers of a at the point 8. We obtain: 


G(a, 8) = J,(8){ N.(8) + aN! (8) + (a2/2) Ni (6) +--+} 
— N,(8){J(8) + oS} (8) + (o#/2) +--+ }. 


The functions J,(z) and N,(z) are analytic except at z= © and, in the 
case of the second one, at z=0. The Taylor series representation is 
therefore valid for all values of a and 8 chosen subject to these restric- 


(4) 
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tions; the series must converge absolutely; and by an allowable re- 
arrangement of its terms G(a, 8) may be written as 
(5) G(a, B) = 2) —Don(8) 
n=1 
where we have adopted the notation, with respect to any variable z, 


(6) Dj,i(2) = — Jv'() Nr (2), 

the superscript [i] meaning the ith derivative with respect to z. We 
shall investigate the expression (5) with a view to ultimately finding 
those terms which may justifiably be regarded as the leading terms 
in 1/R of the expansion (5) under the condition v=dR.? 

Let u(z) and v(z) denote any two independent solutions of the 
Bessel equation of order v. Then u’’ = —z-'u’+2-?(v?—2?)u, and v”’ 
= —z-y’+2-*(v?—3?)v. Substituting these equations into D2, gives 
immediately that D2,,= If we set: 


(7) y = — = — 1, 
we may write 
(8) pD,,+ + yDo,+. 


It is immediate, from equation (6), that if we let the symbol 6 
stand for differentiation with respect to z then: 


(9) = + Di, 
By use of formulas (9) and (8) we readily establish that 
(10) Don (26 + p)Do,n-1 + (y — pb — 5?) Do,n—2- 


We introducethe notationA, = (1/2) Do,n, the factorz/2 being chosen 
to cancel a 2/z which will appear later in every Do,,. Also we intro- 
duce the operators: 


(11) o=25+p and r=y-— pi — 8. 
Then relation (10) above becomes the basic recursion formula: 
(12) An = + TA,~-2. 


The formulas for A, for the first few integers are well known,? and 


2 A special case of expression (5) for the case »=0 is used in Schelkunoff, S., The 
electromagnetic theory of coaxial transmission lines and cylindrical shields, Bell System 
Technical Journal vol. 13 (1934) p. 532 ff. See p. 562 in particular. 

3 For n=0, 1,---,5, the function Do, can be found in Watson, G. N., Intro- 
duction to the theory of Bessel functions, New York, Macmillan, 1945, p. 76. Formula 
(12) provides a convenient way of deriving as many more as may be needed. 
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easily derived. The first two are Ag=0 and A,;=p. From the first 
two, and the recursion relation (12), we see that A.= —p’, since p is 
z-'. In general, the expression A, is a polynomial in p and » of increas- 
ing order in p and v. It is also a polynomial in p and y=(v’p?—1). 
The form of these polynomials becomes exceedingly complicated 
with increasing 1. 

We are interested in the dependence of G(a, 8) on 1/R as R>~ 
and v=dR. To obtain this relationship we intend to let 1/p=z=kR, 
whereupon the combination vp, and therefore ¥, will be of zero order in 
1/R. The variable p itself will be of first order in 1/R. Guided by 
these facts we shall seek out in each A, the terms of total orders one 
and two in 1/R, discarding those of higher order. 

We shall prove by induction that for all n21 


(a) ~ py”; 
(13) 
(b) Asn ~ — n(2n — 1)p*y*"! — 2n(n — 1)p*y*"?, 


where the explicit terms on the right are exact, and where all terms 
omitted are of the form Pp*t*y™, s>0, P a function of m. There are a 
finite number of such terms (for each m) and each of them, as re- 
marked, is of at least third order in the ultimate variable 1/R. 

By the definitions of ¢ and 7 it follows that for every m20, 


(i) ofpy™} = — (1 + 4m)p*y™ — 


(ii) ~ (iii) o{p*y™} ~ 0; 
(14) 
(iv) t{p*y™} ~ 
(v) ~O and r{ptt2y™} ~0, s>0; 


the symbol ~ being understood as directly above. 

Turning now to formulas (13), note first that they are true in the 
special case »=1, and we must verify that if they hold for a given n 
then they hold also for +1. By the recursion formula (12), Aonas: 
=oA2,+7A2,1. Assuming that formula (13) holds for n, we obtain 
Aenyi = —n(2n—1)o { } —2n(n— 1)o {py*}. There- 
fore, by formulas (14), Aoii~py”. Notice that in the case n=1, 
where the middie term involving y*~? might seem disquieting, this 
term is not actually present because of the factor (m—1) in its coeffi- 
cient. This proves formula (13a) for Aen+:1. 

To prove formula (13b) for Asay2 we write formula (12) in the 
form Aons2=GA2n41+TA2,. Now we substitute for A2,;: from the pre- 
ceding paragraph and for A», from formula (13b). Then, Asay. 
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~o {py"} +7 { —n(2n—1)p*y*"! — Finally by for- 


mulas (14), This con- 
cludes the demonstration of both formulas (13). 

We now let z=8 (or kR) and substitute the approximations to A, 
given by formulas (13) in the series (5), after replacing Do,, by 
(2/mr)A,. The use of approximations (13) in the terms of this series 
amounts to regarding the series as one in two variables a and 8 and 
to a reordering of the terms. We postpone the justification for this 
step‘ and turn at once to the series. 

First we separate )-<_, a*/n!A, into the sum of an “odd” and 
“even” series, and a?™/(2m) !Aon. 
Then we substitute for each of the A’s the value from (13). Hence 

= 


= 
1 a?™ 


+ remainder. 


[m(2m — + 2m(m — 1)y""?] 


Each term of this remainder is of the form (1/8*)f,(a, y), s2=3, with 
f, an appropriate series in a whose coefficients are polynomials in y. 

We introduce x? = —a’7; as we shall see later, x is real and positive 
in applications. By simple manipulations using, in the last series, the 
relation n—1=(2n—1)/2—1/2, we may express this as 


a = 1) m—1~2m—1 a? (- 1) m—1 2 m—2 
xB (2m — 1)! 2B? mai (2m — 2)! 


1 a? 
2 xB? (2m — 1)! 


a? 


Qyp? (2m — 2)! 


+ remainder. 


Substituting the trigonometric functions appropriate to the sum- 
mations gives 


a” _ 2a sin x 


2 
G(a, = — 
(a, 8) or x 


2a? [sin x 1 
=| ~ (1 4+ -) cos + remainder. 
7B? L yx Y 


‘ The justification is given in the appendix. The appendix also proves that the 
expression (2) is a Taylor’s expansion about R= ~. 


(15) 


| 
| 
‘hia 
| 
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This is the result given in formula (2). 

The notation in formula (6) and the formulas (12) and (13) are 
adequate to treat expression (3). If we again let a=ka and B=kR 
and denote the expression by I'(a, 8), we obtain 


(16) a? sin x ¥ sin x 
+ + 1) + cos r) + | 
x 


x 


+ remainder. 


oo 


Fic. 1 


3. Physical interpretation of the result. Given a sector of a coaxial 
guide, the cross-section of which is indicated in Fig. 1, the longitudinal 
component of the electrical field supported by such a guide is given 
by the expression® 


(17)  E, = [KN,(kR)J,(kr) — KJ,(RR)N,(kr)] sin vpei#t™ 


where r, ¢, and z are the cylindrical coordinates employed, v= Rmz/b, 
and b=Rq@o. The quantity K is determined by the amplitude of the 


5S. Ramo and J. R. Whinnery, Fields and waves in modern radio, New York, 
Wiley, 1944, pp. 325-326 and pp. 333-335. 


2 ay 
(a, 8B) = — —— 
xr 
\ b 
» | 
\ 
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field excited in the guide. k is determined by setting the left side of 
equation (2) equal to 0. This equation yields an infinite number of 
values of k for any one value of v, each value of k thus determining 
one mode of the guide. 

It is, incidentally, true that setting the right-hand side of equation 
(2) equal to zero gives an implicit equation for & from which it is 
possible to determine k to two terms. However, this value of k may 
also be obtained in other ways. We shall merely note for later refer- 
ence that if we do set this right-hand expression equal to 0, multiply 
by R, and let R become infinite, we obtain sin x=0. Since 
x= we must have a(k?—d?)?=nz, n=0, 
+1, +2. However d=mz/b and so = (n?x?/a?+ /b?) "2, 
which is the formula for the cutoff frequency of a rectangular wave- 
guide supporting the 7M,,,, mode. 

If one holds a and 6 fixed and allows R to become infinite while 
R¢go=6, one sees geometrically that the coaxial sector becomes a 
rectangle with dimensions a and b. The physical interpretation of the 
basic result (2) is made in terms of this rectangle as follows. As R 
becomes infinite the factor sin »p=sin Rmmd/b in expression (17) 
approaches sin mmy/b, where y=Rq@. The limiting value of the 
factor involving the Bessel functions is given by the result of this 
paper; for, by letting t=r—R this factor becomes, by virtue of 
formula (2), 


2K ¢ sinx K # (sinzx 1 \ 
+= (1+ — cos 


(18) 
+ terms of higher order in >’ 


where x? is now —k*t*y and y = (d?/k*) —1. To determine the limit of 
this expression it is necessary to normalize it, for the field strength 
should not be dependent upon R. Normalization may be accom- 
plished by requiring the amplitude of the factor (18) at r=R+<a/2 to 
be 1 for each value of R. Then 1/K=N,(kR)J,(kRR+ka/2) 
—J,(RR)N(RR+ka/2). Again by formula (2) of this paper, wherein 
we now let a be a/2, and by simple reductions and the taking of a 
limit as R becomes infinite, we find that expression (18) has the 
limiting form L sin (k? —m*r?/b?)/2(r — R), where L is independent of 
R and ¢. The k here is k| x-.. which we have seen has the value 
n*x*/a*+m*x?/b?. This result shows that expression (18) permits us 
to exhibit the field in a coaxial sector as a function which approaches 
the field in the rectangular guide approached by the sector as R be- 
comes infinite. 
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APPENDIX 


This appendix will supply several theoretical details needed to 
complete the mathematics of the paper. In particular, it will justify 
the rearrangement of terms made in §2. This rearrangement con- 
verted a power series in a, with coefficients dependent upon v and 
p, into a power series in p with coefficients dependent on k and d and 
subject to the condition that v is a constant over p. As the body of 
the paper shows, when v and p are replaced by their values in terms 
of R, the combination vp is independent of R, the latter being the 
essential variable for the purpose of this paper. 

Actual calculation of the quantities Az, A;, Ay, - - - , by operating 
with the basic recursion formula (12) of the paper, indicates that A, 
for even m can be expressed as a polynomial in even powers of p with 
coefficients which are polynomials in even power of v. Likewise, for 
odd n, A, can be expressed as a polynomial in odd powers of p with 
coefficients which are polynomials in even powers of v. More formally 
stated, we have the following theorem. 


THEOREM A. 


n—1—i 


n—1 
2n—1 2n—2s—1 > 2n-1 2n—1 2(n—1—i—j) 
= Pip wherein P; = >, a 


4,7 
i=0 j=0 
and 
Aon(p) = >. Pi p wherein P; = >, av 
i=0 j=0 


Moreover, the coefficients af; satisfy the relations 


a n— n— 
2n+2 2n+1 
= — (4n — 44+ 1)a;,; 
(b) ( )ai, j 


2n 2 2n 2n 
+ — (28 — 2% — 2) — 


provided that when i+j>(m—1)/2, or or 7<0, or 


The proof of this theorem is readily made by mathematical induc- 
tion utilizing the recursion formula (12). 

It is now necessary to investigate the pattern of signs of the coeffi- 
cients of the polynomials Pf with a view to showing that we are deal- 


ing with an absolutely convergent series. We next prove the following 
theorem. 


— 
| 
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THEOREM B. (1) The sign of af,=(—1)"**'!, m=1, 2, 3,---; 
0<i<(m—1)/2. 

(2) The sign of aj—i+ (2n— 2i— 2) 3=(—1)', i+jsn-1. 

(3) The sign of i+ = (—1)*1, i+-jsn-1. 


ProoF. It is (1) only, of Theorem B, which will be needed ulti- 
mately; however, the nature of the proof forces us to be concerned 
with parts (2) and (3). The proof is by mathematical induction. By 
inspection of Aen-1, and when we find all three state- 
ments to be true. We now assume these statements true for 2”, 2n—1, 
and 2m—2 and proceed to the main step of the inductive argument. 

It follows from relation (a) of Theorem A, with j replaced by j—1 
and the terms regrouped, that for all i, 7 


@;,j-1 + (2n — 2i)a,. j-2 
(a) = — (2n — 21 — 1) {a; 4-2 + (2n — 21 — 1)a;-;-2} 
2n—1 2n—1 
+ @j,j-1 — G_1,;-1. 


By the induction assumption (3) the sign of the first term on 
the right is (—1)**?=(—1)‘; the sign of the second term, by as- 
sumption (1), is (—1)?*~-'**+!=(—1)‘; and the sign of the third term 
is (—1)!+@»—-D+(-D+1 = (—1)*, It follows that assumption (2) holds 
for n+1. 

By the induction assumption (1) the sign of afj_. is (—1)**! and 
the sign of the entire left side of (a) is (—1)* by the previous para- 
graph. It follows therefore that the term a7*} on the left side of (a) 
has the sign (—1)*. This step completes the inductive proof of state- 
ment (1) for the case where m=2n+1. 

To establish statement (3) of the theorem we use relation (b) of 
Theorem A. If in that relation we replace 7 by j—1 and rearrange 
terms slightly, we obtain 


2n+1 


— (2n 2i)a, j-1 — (2n — 21 — 4 
— (2n (2n — j-2 2} 

+ 2(4n — 41 — 2) 4-2 + * 


Since we have already established relation (2) for +1, it follows 
that the sign of the first term on the right is (—1)**!. This is also the 
sign of each of the remaining terms in view of the inductive assump- 
tion for relation (1). Hence the right side and therefore the left side 
has the sign (—1)**+!, which establishes relation (3) for m+1. At the 
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same time, since a73*; has the sign (—1)‘ by the inductive proof of 
relation (1) when m is odd, it must be that the sign of &*t is (—1)*! 
and this is the proof of relation (1) for m=2n+2, which completes 
the entire proof of that relation. 

In consequence of (1) of Theorem B, the coefficients af, in the 
polynomials P? in Theorem A are all of one sign, depending on m 
and 7, and we obtain the following corollary. 


Coro.iary I. 


n—i-1 
i_2n—1 2n—1 2(n—1—i—j) 
(-1)P; => la; |v 
atts 
i+1_2n 2n 2(n—1—i—j) 
(—1) P; = > 
j=1 


We now let p=(—1)*r and substitute this and the results of 
Corollary I into the expressions for Az,1 and As, of Theorem A. 
Further we let a=(—1)'/?A. Then we have the following corollary. 


II. For a=(—1)"*A and p=(—1)"*r7, —aA,,(9) 
=A™Q,,(r, v), m=1, 2, 3,---, where the coefficients in the poly- 
nomial Q,, are all positive. 


We are now near the conclusion. We recall that the Bessel functions 
J,(z) and N,(z) are analytic functions of the argument z everywhere 
except at z= ©, and for the second of these functions at z=0. It follows 
at once that G(a, 8) = J,(8)N.(a+ 8) —J,(a+8)N,(8) is analytic in 
and excepting possibly at the locia=0, a= ~ ,8=0, 

We already know from formula (5) that (22)G(@, p-') 
= )°8_; (a™/m!)An(v, p). The series on the right converges for all a 
and p=f-'! which are not on the exceptional loci. By Corollary II, 
1/(—1)"*r) = (A™/m!)Q,(v, This power 
series in A converges for every A, for every fixed 7, with exceptions 
as above, provided, of course, that in the summation we regard the 
terms in each Q,,(v, 7) as locked in a parenthesis, the entire poly- 
nomial serving as coefficient for A”. But now, since each coefficient 
of Q,,(v, r) is positive, we may regard the series on the right as a simple 
sequence of terms (A and r being thought of as fixed, arbitrary, real 
positive numbers) and we may certainly rearrange the terms of this 
simple sequence in any order, and reintroduce parentheses in any way 
we wish. The resulting series will converge to the same value as the 
original series. Moreover r and A may also assume imaginary values 


6 See, for example, T. J. Bromwich, An introduction to the theory of infinite series, 
London, Macmillan, 1908, articles 31 and 33. 
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and the rearrangement of terms remains justified, for the absolute 
values of the imaginary r and A are the same as for real r and A and 
the convergence of the absolute values suffices to justify rearrange- 
ment. 

An entirely analogous argument disposes of the same problem for 
the derivation of formula (16). 

It should now be noted that the rearranged series, that is, formula 
(2), is a power series in 1/R which, because it equals the original series 
for G(a, 8), certainly converges for all values of R except, perhaps, 
the few exceptional ones R=0, R=, and R=—a. As a power 
series in 1/R it must also converge then for R= ~ and it represents 
the function G(a, 8B) at R= © if we define the function to have value 0 
at that point. Formula (15) is therefore a Taylor’s expansion of 
G(a, 8) around R= © under the condition vy =m7R/b.7 


New UNIVERSITY 


7 Acknowledgment is hereby made of assistance from Dr. Leo Zippin in carrying 
out several inductive proofs of the paper. 
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ON A SIMPLE PROPERTY OF THE DERIVATIVES 
OF LEGENDRE’S POLYNOMIALS 


EMIL GROSSWALD 


It is well known that if we define, as usual, Legendre’s polynomials 
as 


1 P.. = | 

we have 

(2) P,(1) = 1 for every n. 


More generally, for every m and r, the following relation holds: 


(r) 


Ph (1) = Catenr°1-3 + (27 — 1) 
(3) 
2-4+--+2r 


As we could find nowhere a proof of (3), except in the particular case 
r =1,' we shall give here a simple one. 
From the classic relation? 
(4) P!(x) = + 
we obtain, by successive derivations, 


(r) (r—1) 


(4’) = (mtr — (x) + P(x). 


Letting x=1 in (4), we obtain, for n=yv, P}(1) =vP,_,(1)+P_,(1), so 
that, by (2), 


= » + 


letting successively y=1, 2, - - - , 2 and adding, we obtain 


(5) P,(1) = = n(n + 1)/2 


Received by the editors June 23, 1949. 

1 See Catalan, Mémoire sur les fonctions X, de Legendre, Mémoires Couronnés de 
l Academie de Belgique vol. 31 (1881). 

2 See D. Jackson, Fourier series and orthogonal polynomials, Carus Monographs, 
p. 48. 
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which proves (3) for r=1. Similarly, for r=2, we obtain from (4’), 
with x=1 and n=», 


P.’(1) = + + P4(1), 
and, by (5), 


Letting here, successively, y=2, 3, - - - , m and adding, we obtain: 
(6) PY (1) = — 1)-»-(v + 1)/2 
v=2 


= (n — 1)-n-(n + 1)(n 4+ 2)/2-4, 


proving (3) for r=2. Let us assume now that (3) holds for all deriva- 
tives up to the order r—1; then, by (4’), we obtain, for x =1 and sub- 
script pv: 


(7) = +7 — (1) + PA), 


or, using (3), valid by assumption for derivatives of order not greater 
than r—1, 


=@+r—1)4 
(7’) 2-4-++ (2r — 2) 
+ 
Writing this relation for y=r, r+1,---,n-—1, » and taking into 


account that P®, (1) =0, we obtain, adding, 


P, (1) = at! 
var 2-4---+ (2r — 2) 
2-4++-+ (2r) 


finishing the proof of (3). 

From the form of (3) it results immediately, that all values of 
P®(1) are integers. As expressions containing P’(1) appear in con- 
nection with problems of divisibility, or irreducibility of polynomials, 
it may be of some interest to remark that, as a consequence of (3), 
P* (1) are always divisible by all odd factors of (2r)!, as well as by the 
highest power of 2 which divides Car, 
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APPLICATIONS OF A LEMMA OF FEJER TO 
TYPICALLY-REAL FUNCTIONS 


M. S. ROBERTSON 


1. Introduction. Let 
(1.1) w = f(z) = 2+ ae? +--+ 


be regular for | s| <R and f(x) real for —R<x<R. f(z) is said to be 
typically-real with respect to the circle | z| = R if the imaginary part 
of f(z) has the same sign as the imaginary part of z when z is not 
real and | s| <R [1]. In particular, if f(z) is schlicht or univalent 
for | z| <R and the coefficients a,, n=2, 3, - - - , are all real, then 
f(z) is a typically-real function. If further, w=f(z) maps each circle 
| =r<R into a contour having the property that every line parallel 
to the imaginary axis cuts the contour in not more than two points, 
we say that f(z) is convex in the direction of the imaginary axis rela- 
tive to the circle |z| =R. The necessary and sufficient condition that 
f(z) be convex in the direction of the imaginary axis when it is real 
on the real axis is that z f’(z) be typically-real [5; 4]. In the discussion 
to follow we shall assume that R=1. 
We state the following lemma due to L. Fejér (Turan [2], Szasz 


[3]): 


Lemma 1. For a value of r, O0<r<1, let } a A,r" converge. In order 
that 


n=1 


it is necessary and sufficient that 


(1.3) nd,7"-sin ny = 0 for 


n=l 
It is the purpose of this paper to give some applications of Lemma 
1 to functions 


(1.4) F(z) = 


n=l 


where 


Presented to the Society, February 25, 1950; received by the editors June 27, 1949. 
1 Number in brackets refer to the references cited at the end of the paper. 
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(1.5) f(z) = anz*, g(z) = >> baz", = = 1, 
n=1 n=1 

are both typically-real with respect to the unit circle | z| =1. The same 

method of proof may be used to obtain many special results for the 

functions F(z) of (1.4). We include here the proofs of several theorems 


which illustrate the procedure. 


THEOREM 1. Jf 
(1.6) f(z) = g(z) = a, = = 1, 
are regular and typically-real for | s| <1, then 
(1.7) F(z) = >> AnDnZ”, |z| <1, 
n=1 


ts typically-real for | s| When 
f(z) = = Fe) = + 2)-(1 — 2), 
F(z) is not typically-real for | z| <p for any p>2—3"/?. 


THEOREM 2. If 


(1.8) = g(z) = ba a, = = 1, 
are regular and typically-real for | z| <1, and if 
(1.9) F(z) = 
n=1 
then 
(1.10) h(z) dz = > 3” 


ts regular and typically-real for | s| <3. 
THEOREM 3. If 
(1.11) f(z) = g(t) = =1, 
n=1 n=l 


are regular, univalent, and convex in the direction of the imaginary axis 
for |z| <1, and real on the real axis, then 


| 
| | 
| 
| 
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(1.12) F(z) = > 


ts also regular, univalent, and convex in the direction of the imginary 
axis for |z| <1. 


If in Theorem 3 we take g(z) to be the mth Cesaro partial sum of 
order 3 of the geometric series 


(1.13) 


which E. Egervary [6] has shown to be univalent and convex for 


| z| <1, we obtain immediately a new proof of the following theorem 
of L. Fejér [5]. 


THEOREM 4. (Fejér) Jf 


(1.14) fe) = cas” 


n=l 


be regular, univalent, and convex in the direction of the imaginary axis 
for |z| <1 and real on the real axis, then the Ceséro partial sums of 
order 3 of (1.14) are also univalent and convex in the direction of the 
imaginary axis for |z| <1. 


THEOREM 5. Jf 
(1.15) f(z) = Dos g(z) = Ds a, = 5; = 1, 
are regular and typically-real for | z| <1, and if 
(1.16) F(z) = |z| <1 
n=l 
then RF(re*)=0 for |@| Sarc cos O<r<1, where 4ry(r) 


=(1+34r?+r*)/2—1—r?. The inequalities are sharp. 


Theorem 5 includes a result of W. Rogosinski [1] who has shown 
that F(r)=0, 0<r<1. It is not hard to show from Theorem 5 that 
the function f(z) of this theorem satisfies the inequality 


f() 


for z=re*, r<i, |@| Sarc cos p(r*), with equality holding for 
@=arc cos u(r’), f(z) =2z(1—z)-*. Thus a specially weighted average 
of the cosine of the angle subtended at the origin by the points 


(1.17) | f(ze*) - f(z) | cos arg ap = 0, 
0 


| 
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f(z), f(ze*) is non-negative, although the cosine itself may be some- 
times negative. We omit the proof of (1.17). 


2. Proof of Theorem 1. It has been shown elsewhere [4] that the 


coefficients of the powers series for f(z) and g(z) in (1.5) have the 
representation 


i f* snas 1 * sin ny 
—f ae, tea d8(y) 
T 0 0 


sin x sin y 


where a(x), 8(y) are non-decreasing functions in the interval (0, 7) 
normalized so that 


It follows that the inequality 
(2.3) IF (re*®) = > a,b,r” sin n@ = 0 
n=1 


for 0<@<z7 for a suitable range of r can be deduced from the in- 
equality 

“. sinnx sin ny 
(2.4) 


- - -r™ sin nO = 0, 
SiNx sin y 
by term by term integrations with the aid of (2.1). However, if we 


apply Lemma 1 twice, we need only find the range of r for which 
(s=re*) 


z+ 2? 
(2.5) > n*r* sin nd = 0, 0<0<z. 
(1 = z)* n=1 


+ 2*)-(1 — 2)3} 


rsin 0{1 — 6r? + — 2(r + cos 6} 
r(1 — r)?-(1 + 4r + 1’) sin 0 
0 for0S65 7,087 2 — 


(2.6) 


IV IV 


The example f(z) =g(z) =2(1—z)-*, F(z) =(z+27)(1—z)-, shows that 
the range of r obtained above cannot be improved upon. This com- 
pletes the proof of Theorem 1. 


3. Proofs of Theorems 2 and 3. Using the notation of Theorem 2, 
we have 
ah 


(3.1) h(z) = > 


n=1 


2", <1, 


| 
- 

| 
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b 
(3.2) I{h(z)} = 


n 


n io 
r™ sin z= 


Thus J{ i h(z) } =0 for 0S@ <7, 0<r<1, follows from 


sin nx sin ny 
(3.3) sin = 0, 
a-1 sinx siny 


by term by term integrations and the aid of (2.1) and (2.2). But by 
Lemma 1, (3.3) may be replaced by 


sin ny 
(3.4) > -r™ sin = 0 for 
n=—1 Siny 
(3.5) r™ sin = T{2(1 — 23 cosy + 
siny 


Moreover, 2(1—2z cos is typically-real for | <1, 
Thus h(z) is also typically-real for | z| <1. This completes the proof 
of Theorem 2. 


Theorem 3 follows from Theorem 2. For if 


x 


(3.6) = Liew”, gz) = Do 4 =d=1, 
n=1 


n=l 


be regular, univalent, and convex in the direction of the imaginary 
axis, and real on the real axis, then 2f’(z) and 2g’(z) are typically-real 
for | s| <1. It follows from Theorem 2 that 


is typically-real for |z| <1. Thus 
(3.8) F(z) = | 
n=1 


is regular, univalent, and convex in the direction of the imaginary 
axis. 


4. Proof of Theorem 5. With the notation of Theorem 5 let 


(4.1) F(z) = Do = = 1, |2| <1. 


n=) 
With the aid of (2.1) and (2.2) it follows that the inequality 
(4.2) RF = 0, | @| < arc cos u(r), 0<r<i, 


| 
= 
\ 
| 
\ 
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can be deduced from the inequality 


(4.3) 


SiINX siny 


sin nx sin ny 


|0| arc cos u(r), 


by the method used in the preceding paragraphs. By Lemma 1, we 
need only find when 


(4.4) -r™ cos nO = 0, 
a-1 sin 

or, letting when 

(4.5) r™ cos na S 0, 
n=1 sin 

or when 

(4.6) R{(z — z°)-(1 — 2zcos@+ =< 0, z= 

Since 


— 2)-(1 — 22 cos + 2*)} 
(4.7) = r(r? — 1)[4r? costa — {1 + (6 + 4 cos? + cosa 
+ (4r + 4r*) cos ¢|, 


it will be sufficient to determine the values of a so that when 0 <r <1, 
we have 


4r? a — {1 + (6 + 4 cos? + COS 
+ (4r + cos = O. 


(4.8) 


(4.8) may also be written in the form 


(4.9) (2r cos a — 1 — r*) {27 cos? a + (1 + 7?) cos a — 4r} 
; + 4r?(1 — cos? @) cos a — (1 — cos ¢)(4r + 4r*) = 0; 

with regard to the last two terms of (4.9) we note that 

(4.10) 4r2(1 — cos? @) cosa — (1 — cos ¢)(4r + 4r*) = — 8r(1 + 7?) 


with equality occurring in (4.10) for ¢=z and for all a. Thus the 
left-hand side of inequality (4.8) is not less than 


(4.11) 4r? cos* a — (1 + 10r? + r*) cosa — 4r — 4r? 


and the left-hand side of (4.8) equals the expression in (4.11) when 
@=7. Factoring (4.11) we have to determine when 


| 
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(4.12) (1+ 2r cos a+ r*){2r cos? a — (1+ cosa — 4r} 2 0. 
(4.12) holds when, and only when, 
—4rs4rcosas — {(1 + 34r? + r?} 

— p(r)-4r. 


Thus |6| =|2—a| Sarc cos u(r) 
The estimate for the bound on @ is sharp, as is seen by taking 


f(z) =g(z) =2(1—2)~*. 


(4.13) 
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ON CONFORMAL MAPPING OF NEARLY CIRCULAR REGIONS 
S. E. WARSCHAWSKI 


1. Introduction. The following problem has repeatedly been con- 
sidered in the literature on conformal mapping. Given a simply con- 
nected region R which contains the origin and whose boundary lies 
between two concentric circles about the origin of radii 1 and 1+¢« 
(e€>0) respectively. Suppose that the function w=f(z) maps the 
circle |z| <1 conformally onto R in such a manner that f(0) =0 and 
f'(0)>0. It is desired to find: (a) an upper bound for the function 

f(z)—z| in terms of ¢ and r which holds for all z in a fixed circle 
z| <r<1, and (b) a bound which depends on € only and is valid 
(possibly under additional assumption on the boundary of R) for 
all z in the complete (open or closed) unit circle. Estimates of the 
first kind were given by L. Bieberbach [1],! A. Ostrowski [10], C. 
Carathéodory [4 and 5], M. Lavrientieff and D. Kwasselava [7], 
and M. Miiller [9]. Contributions to the second problem were made 
by L. Bieberbach [3], A. R. Marchenko [8], and J. Ferrand [6]. 

In the present paper we consider, more generally, the integral means 
of order p> 0 of the function | f(z) —3| taken along the circles | =f, 
namely 

f(z) —z}= f | f(re%) — 

0 
and those of the first and second derivatives of this function. Bounds 
in terms of € are obtained for these means which are valid uniformly 
for r<1. As an application of our estimate on the mean of the first 
derivative, we give a short proof of an inequality concerning | f(z) —s| 
in || $1 which is due to A. R. Marchenko. Furthermore, a bound 
for the modulus of the derivative f’(z)—1 for z in |z| <1 is derived. 
In all cases the estimates obtained are of the form O(e) as e—0 and 
are the best possible as far as the order of magnitude in € is con- 
cerned.’ 

The knowledge of bounds for | f(z) —z| and for some of the expres- 
sions considered in this paper is helpful in the application of certain 
iteration methods for determining the mapping function of a region 
onto a circle. For these inequalities may sometimes be used to ap- 


Presented to the Society, April 28, 1944; received by the editors March 17, 1949. 

1 The numbers in the brackets refer to the bibliography at the end of the paper. 

2 This is easily shown by simple examples such as f(s) =((z—ce)/(1—cez) +.) 
(1/(1 —ce)) where c is a sufficiently small positive constant. 
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praise the accuracy of an approximation to the mapping function; 
see, for example, L. Bieberbach [1 and 2, p. 10], and a more recent 
paper of the writer [14]. In fact, the inequalities (5.1) and (6.1) for 
pb=2 of the present note were stated and used in that paper (§9d) 
for such a purpose. However, the proofs of these statements were 
omitted there and are included in the present discussion of such 
inequalities. 

The proofs of the inequalities involving the means make use of the 
following well known theorem of M. Riesz [11] on conjugate har- 
monic functions: 

Suppose that the function f(z) =u(s)+i0(z) is regular for | s| <4 
and that u(0)=0. Then for every p>1 there exists a positive number 
A, which depends on p only, such that for OSr<i1 


(1.1) M,{u(re#)} = f < A,M, {o(re#)}. 
0 


If the right-hand side of (1.1) ts bounded for all r, OSr<\1, then 
f(re*®) has radial boundary values of class L, almost everywhere on 
|z| =1 and (1.1) holds for r=1. For p=2 all A,<2p and A2=1. 


2. The mean of the function f(z) —z. We begin with the following 
theorem. 


THEOREM I. Let R be a simply connected region in the w-plane 
which contains the origin and whose boundary is contained in the ring 


(2.1) 1<|w|<ite (e > 0). 


Suppose that the function w=f(z) maps the circle | s| <1 conformally 
onto R and that f(0)=0 and f’(0)>0. Then for every* p>1 and for 
z=re”, uniformly for 0Sr<1, 


(2.2) Mp{f(z) — 2} < n, it < (1+ A,)et-e 


where A, is the constant of the Riesz theorem.* 


3 It is clear that the inequality (2.2) if proved for p>1 remains valid, for a suitable 
constant Az», for all p, 0<p<1 (Hélder’s inequality). An analogous remark applies 
to the other estimates of the means derived in this paper, and we state and prove 
therefore Theorems I, II, III, and VI for p>1 only. 

* By use of Cauchy’s integral formula one may easily obtain from Theorem 
I a bound for |f(z)—z| for all |z|Sr<1. If t=pe#, r<p<i, |f(z)—s| 
f2*| (f(t)—t)/(t—2)| dos M, {f()—t} Me { , by Holder's inequality, where 
1/p+1/q=1. Hence, as p—1, S(1+A,)e‘e: (1—r)~/*. However, this esti- 
mate is not the best possible as far as the order of 1/(1—r) is concerned, the best 
given in the literature being log 1/(1—r) (see [5, p. 51] and [9]). 
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Proor. The function F(z) =f(z)/z for 0<|z| <1, F(0)=f'(0) is 
regular for | z| <1. Since the boundary of R is contained in the ring 
(2.1), a simple application of the principle of the maximum and mini- 
mum modulus shows that 


for |z| <1. 
Hence 
(2.3) O < log | F(z)| S for |z| <1. 


The branch of log F(z) =log | F(z)| +2 arg F(z) for which arg F(0) 
=arg f’(0)=0 is single-valued and regular for |z| <1. By the 
theorem of M. Riesz stated above we obtain therefore for z=re*, 
and all p>1: 


M, {arg F(z)} A,M,{log | F(z)| } < Ap-e. 
Hence, by use of Minkowski’s inequality, 
(2.4) M,{log F(z)} < M,{log | F(z)| } + M,fargF(z)} < (1+ 
Since for any complex number a 
Je — 1] ol 
we have (for a=log F(z)) by (2.3) 


f(2) | 


S | log ef 


and therefore, by (2.4), 
n, i} < (1+ A,)et-e. 


3. The mean of the function zf’(z)/f(z)—1. Our next aim is to 
obtain an analogous result for the derivative of the mapping function. 
This will be accomplished essentially by the following theorem. 


THEOREM II. Let C be a closed Jordan curve represented (in polar 
coordinates) by the equation p=p(o), 0S@X2r, where is positive, 
continuous, and moreover satisfies the inequality 


(3.1) | o(@ + 7) — S (for some e, 0 < 1). 
Suppose that w=f(z) maps the circle | z| <1 conformally onto the in- 


®For, |e—1| =| S| the integration being taken 
along the straight line segment 0 - - - a. 


( 
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terior of C such that f(0)=0 and f'(0)>0. Then for all p>1 for which 
€A,<1 and uniformly for 0<r<1i, 


f'(z) 1+ A, 


Proor. Just as above let F(z) be defined as f(z)/z for z¥0, and as 
f'(0) for z=0, and let log F(z) denote the branch of the logarithm 
which is real when z=0. Then F(z) and log F(z) are regular in | z| <1 
and since C is a Jordan curve, they are continuous in the closed circle 
|z| $1. Thus in particular Q(z) =arg F(z) is single-valued and con- 
tinuous in |z| <1. We define a branch of arg f(e#) by the relation 


arg f(e*) = Q(e*) + 8. 


If g(@) is a function defined for all real 6 and 4 is a positive number, 
we shall find it convenient to use the notation 


Ag(6) = g(6 + 6) — 


Since by hypothesis (3.1) log p(@) possesses bounded difference 
quotients, p’(@)/p(¢) exists almost everywhere and 


fal Stet. 


(s = re*). 


| log p(@ + 7) — log p(¢)| = 


Hence—note that | f(e#)| =p(arg f(e#))— 
| A log | f(e*)|| S | A arg f(e*) |. 
Now 
A arg f(e*) = A arg F(e*) + 5, 

and therefore, by Minkowski’s inequality, for p>1, 
(3.3) {A log | f(e*)| } S + Mp {A arg F(e*)} ]. 

By the theorem of M. Riesz, applied to log F(e*), we obtain 
(3.4) M,{A arg F(e*)} < A,M,{A log | F(e*) |}. 
Hence, from (3.3) 

log | F(e#) | } 


M,{A log | f(e*) | } 
e[5 + A,M,{A log | F(e*)| }] 


IIA 


or 


(3.5) (1 — €A,)M,{A log | F(e*)| } < ob. 


\ 
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Since (1—€A,)>0, we may divide (3.5) by 1—€A,. We also 
divide by 6 and, combining the so obtained inequality with (3.4), 
we find 


AlogF 1+ A 
(3.6) m4 = € (z = e#), 
6 1—éA, 


Since log F(z) is regular for | z| <1 and continuous for |z| $1, the 
inequality holds also when z=re” where 0<r31. Replace in (3.6) z 
by re* and keep r <1 fixed. The right-hand side of (3.6) is independent 
of 6. We may let 5-0, passing to the limit under the integral sign, 
and thus we find (3.2). 


4. Applications of Theorem II. As an application of the preceding 
result we obtain the desired estimate for the mean of the derivative. 


THEOREM III. Let C be a closed Jordan curve represenied by the equa- 
tion p=p(o), S2r, where p() ts positive, continuous, and satisfies 
the following additional conditions: For some €, 0<e<1, 


(i) 1 < p(¢) 
(ii) | + +) — p(o)| S v(o)e| 


If w=f(z) is defined as in Theorem II, then uniformly for 0Sr<1, 
p>1, «A,<1 


1+ A, 
(4.1) M,{ 1} = 2 " = re*), 
To prove this we write for 0<|s| <1 
1 
|f(@) || @) |, f@)_ || 
at 


Since hypothesis (i) implies that | f(z)/s| S=1+¢« (see §2) we obtain by 
use of Minkowski’s inequality: 


Mp -1} om, it + it 


Hence by Theorems II and I: 
2(1 + A,) 


— €A, 


— 
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Theorem III is a sharpening of the following result which was first 
stated by A. R. Marchenko [8] in 1935 and which follows easily 
from (4.1).® 


THEOREM IV. Under the hypotheses of Theorem III, there exists an 
(absolute) constant K such that for | z| <1 


(4.2) | — 2| S Ke. 


To prove this one needs only to apply the Fejér-Riesz inequality, 
which states that for any function g(z) regular for | z| Sf, 


lA 


For, if z=re‘*, 0<r<1, 


< | f’(re#) — 1|*d0 < | f’(re#) — 1|2d0 
or 
(4.3) | fe) — 1} 


by (4.1)—note that A2.=1. Now, if ¢<1/3, the factor of € in the last 
term of (4.3) does not exceed 6e'/*r'/?, so that K in (4.3) may be taken 
equal to this number. If, however, 1/3<e<1, then we make use of 
the fact that | f(z) —3| <=3<9% and choose K=9. This proves (4.2). 


5. An estimate for the derivative of the mapping function. The 
hypotheses of Theorem III are not sufficient to insure the existence 
of an inequality analogous to (4.2) for the derivative, f’(z) —1 in the 
full circle |z| <1, as under those assumptions f’(z) need not even be 
bounded in |z| <1. However, if we add a further condition (essen- 
tially involving the curvature of C) then we can prove the following: 


THEOREM V. Let C be a closed Jordan curve which satisfies hypotheses 


6 Marchenko stated his theorem without proof. Another proof of this inequality, 
essentially based on the use of the Poisson integral, is given by E. J. Specht in a 
forthcoming paper [12]. In addition, bounds in terms of ¢ for the derivatives 
--+, which are valid in |z| $1 are derived in this 
paper. Specht’s estimate for the first derivative is based on somewhat less restrictive 
assumptions than those of our Theorem V below, and yields a bound of the same 
order of magnitude in e, if applied under the hypotheses of our Theorem V. 
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(i) and (ii) of Theorem I11. Furthermore, suppose that p'(@) exists every- 
where and that’ 


(iii) <elrl, 
p p 


€ being the same as in (i) and (ii). Suppose that f(z) is defined as in 
Theorem III and that $(6) =arg f(e*). Then f'(z) exists and is continu- 
ous in |2| and 


where A and 

(5.2) —1| S <1, 
where B=A+(A—1)/e.° 


Proor oF (5.1). Under the present hypotheses the mapping func- 
tion f(z) possesses a continuous nonvanishing derivative® f’(z) for 
| z| <1. If log (f(z)/z) again denotes the branch of the logarithm 


which reduces to the real log f’(0) when z=0, then we have, for 
s=e*, 


(5.3) log = log p[$(0)] + i{ — 4}. 


Differentiation with respect to 8 leads to the relation 


1 
(5.4) iz = ]o’(6) + if — 1} 
or 


The notation (p’/p)(¢) means p’(¢)/p(¢). 

§In [14, § 9d], the hypothesis on p(¢) which corresponds to condition (i) of 
Theorem V of the present paper is (1+«)—'a<p(#) Sa(1+<«), where a is a positive 
constant. The constant A(=e4*) is given there erroneously for the case proved 
here under the assumption (i), and it should be replaced by A =¢%"4*, This is seen by 
an obvious modification of our proof of (5.1); only the relations (5.10) and (5.11) 
and the inequalities which follow from them must be changed. 

* See, for example, [13, p. 327]. 
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Since f’(e*) ~0, it follows that #0. The function 
af'(2) af’ (z) 2f'(z) 

og = | rg —— 
f(z) f(2) f(z) 


which is defined as being 0 when z=0, is single-valued and analytic 
for |z| <1, and continuous for |z| <1. If we set 


(5.6) ] og +ia 


w(¢) = — arctan — [9], 
p 


where the principal value of the arctangent is chosen, then, for z=e*, 


= w[o(6)], log log {¢ (8) sec [o(6) ]}. 


(5.7) arg 
By a well known formula for the conjugate function we have therefore 
1 T 
log {¢’(8) sec ]} -—f + 7)] — — cot— dr. 
0 


Because of hypothesis (iii) it follows easily that 


| w[o(@ + 7)] — — 7)]| +7) — —7)}. 


Hence 


| log {¢’(0) sec ]} | 
< + 1) — — cot— ar, 


or 


| log sec w [9(6)]} | 


a T 


€ 
+ — tr cot — dr. 
T 0 2 


The second term on the right-hand side of (5.8) has the value 2e log 2. 
Furthermore, since log p[¢(6)] and ¢(6)—@ are real and imaginary 
parts of log (f(z)/z) for z=e*, and the real part of this function is 


log f'(0) when z=0, it follows, again by the formula for the conjugate 
function, that the first term on the right-hand side of (5.8) is equal to 


= 
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e{log p[o(@) ] —log f’(0)}. Thus 


Shin? 
f'(0) 


By hypothesis (i) we have for | z| =1 and therefore also for 0<|z| <1 


(5.10) 1s 


(5.9) log {¢’(@) sec w[¢(6)]} | < € log 


and thus, as z—0, 
(5.11) is /f’0) Site. 


We have therefore 


og — 
f') 
Thus, we finally find from (5.9) 


S log (1+ €) <«. 


(5.12) —log sec w[o(0)] — — 2e log 2 S log + 2c log 2. 
By hypothesis (ii) 


1 1 
—log sec w[¢(0)] = — log 
2 1 7, 6) ])2 
(5.13) + ((p’/p) [o() | 
1 


The inequalities (5.12) and (5.13) establish (5.1). 
Proor oF (5.2). From (5.4), hypothesis (ii), and (5.1) we obtain 


f(z) 


A simple calculation shows that 1—1/A(1+e*?)"/?<A —1, so that, by 
(5.1), | —1| <A-—1 and therefore 
2f’(z) 


f(z) 


(5.14) 


Next we write 


lf’) s 


z | 


| 
| 
| 
J 
| 
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z |! fe) 


+f )lal+iro - 11. 


Using (5.10), (5.14), and (5.11) we find 
—1| (1+ )(4e+A—1) +6 


(5.15) 


Now the function 
-1) 
dt\ t t t \f@ 


is regular for | ¢| <1 (including t=0, if properly defined) and continu- 
ous for | ¢| <1. But for | ¢| =1 we have by (5.10) and (5.14) 


d 


(~) < (14+ 0(4e+4 — 2), 
dt 


and by the principle of the maximum modulus this holds also for 
| ¢| <1. Combining this result with (5.15) we obtain (5.2). 


6. The mean of the second derivative of the mapping function. 
The assumptions made in Theorem V permit one to go beyond the 
statement proved on the first derivative of the mapping function and 
to obtain estimates for the mean of order p of the second derivative. 


THEOREM VI. Under the hypotheses of Theorem V, $'(0) is an ab- 
solutely continuous function and for any p2=2 


(6.1) 

where A =4*e* and M,=A,A'? min (1+¢, 2!-"/”), and for p>1 

(6.2) (4 + B+ (s = re#,0 Sr <1) 


~ 


where B=A+(A—1)/e. 


PROOF OF (6.2). We consider as in equation (5.6) the branch of 
log (zf’(z)/f(z)) which is defined as 0 when z=0. Then arg (gf’(z)/f(z)) 
is harmonic for |z| <1 and continuous for |z| <1 and by (5.7) 


— 
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arg (2f’(2)/f(z)) =w[6(@)] for z=e*. Because of hypothesis (iii) 
and (5.1), w[@(6)] possesses bounded difference quotients for all @, 
so that w’ [¢(0) |’ (0) exists for almost all @ and is bounded. This implies 
that the function @ arg (zf’(z)/f(z))/00 may be represented by the 
Poisson integral for | z| =| re#| <1. For, if we write 


1 —?? 


= 
1 + 7? — 2r cos (0 — #) 


we have 


2f’(z) 
ae | = w(o(t)|K(r, @ — t)dt 


2r 


1 
w|o(t) | — K(r, — t)dt 
2x 0 ot 


1 2r 

2x 0 

and from this representation it follows that, for z=re*, 
2f’(z) } 

by hypothesis (iii) and (5.1).!° 


We apply now the theorem of M. Riesz to 2 log (zf’(z)/f(z))/00 
and find from (6.3) that, for z=re®, 0<r<1, 


f@) 


(6.4) log 


and therefore 


} S A,M,{w' < ApA™!?-e, 


(6.5) M lo < A,) 
f(z) = Ap). 


Since 


af'(z) ( f'(2) ) 
06 f') f(z) 

it follows from (6.5) and (5.14) that 
af’’(2) 
S + A,) + Be S (A+ B+ 
This proves (6.2). 


h 
| 
| 
| 
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PRrooF oF (6.1). The inequality (6.4) implies that the function 
log | sf’(2)/f(z)| for z=e* is an absolutely continuous function of @. 
Hence, by the second of the relations (5.7), ¢’(@) is an absolutely con- 
tinuous function. Furthermore lim,.; log| rf’ (re®) /f(re*)| /00 ex- 
ists for almost all @ and is equal to 

+ tan w[9(6) Jo’ ]¢’(6). 
We may pass to the limit as r—1 under the integral sign in (6.4), thus 
obtaining 


(6.6) m, + tan < A,M,{w'-¢’}. 


In order to estimate M, {o’’/o'} we write 
= (¢"/¢' + tan w-w’-¢’) — tan w-w’-¢’ 
and apply Minkowski’s inequality. Using (6.6) we find 


(6.7) + M,{ tan w-a'-¢'}, 
and by hypotheses (ii) and (iii) and (5.1) 


(6.8) m4 < A,Al + Alive? = + oe. 


We can estimate the right-hand side of (6.7) in a different way. 
First, we note that it does not exceed" 


{ f 2° | +| tan yao 
0 


1 1/p 
< f + | tan 
2x 0 


Next, using the fact that 


_ 
do 
p 
1+(7) | 
p p 
we obtain 
¢” 1 Qe 1+ | p’/p|” 
2x Jo (1 + (0’/p)*)? 


11'We are making use here of the fact that, for 220, b=0, p21, a/?+5'/? 
$2((@+b)/2)"? = 
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Now since p22 


and therefore 


Thus we find from (6.8) and (6.9) that 


< min (1 + 2-49), 


Finally, if we note again that ¢’(@) SA, we obtain (6.1). 
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